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ABSTRACT 

We analyze the emission properties of a new sample of 3 579 type 1 AGN, selected from 
the SDSS DR7 based on the detection of broad Ha emission. The sample extends over a broad 
Ha luminosity LbHa of lO"*" - 10'*^ erg s"' and abroad Ha FWHM of 1 000 - 25 000 km s^^ 
which covers the range of black hole mass 10^ < Mbh/M;) < 10^^ and luminosity in Edding- 
ton units 10"^ < L/Ledd < 1 • We combine ROSAT, GALEX and 2MASS observations to fomi 
the SED from 2.2 ^m to 2 keV. We find the following: 1. The distribution of the Ha FWHM 
values is independent of luminosity. 2. The observed mean optical-UV SED is well matched 
by a fixed shape SED of luminous quasars, which scales linearly with LbHa, and a host galaxy 
contribution. 3. The host galaxy r-band (fibre) luminosity function follows well the luminos- 
ity function of inactive non-emission line galaxies (NEG), consistent with a fixed fraction of 
'^ 3% of NEG hosting an AGN, regardless of the host luminosity. 4. The hosts of lower lumi- 
nosity AGN have a mean z band luminosity and u~z colour which are identical to NEG with 
the same redshift distribution. With increasing LbHa the AGN hosts become bluer and less 
luminous than NEG. The implied increasing star formation rate with LbHa is consistent with 
the relation for SDSS type 2 AGN of similar bolometric luminosity. 5. The optical-UV SED 
of the more luminous AGN shows a small dispersion, consistent with dust reddening of a blue 
SED, as expected for thermal thin accretion disc emission. 6. There is a rather tight relation 
of vLv(2keV) and LbHa, which provides a useful probe for unobscured (true) type 2 AGN. 7. 
The primary parameter which drives the X-ray to UV emission ratio is the luminosity, rather 
than Mbh or L/Ledd- 
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1 INTRODUCTION 

The first systematic study, based on optical spectroscopy, of a com- 
plete and well defined sample of Broad Line AGN was conducted 
by Boroson & Green (1992, hereafter BG92). It included 87 z < 0.5 
AGN from the Bright Quasar Survey (BQS, Green et al. 1986), 
with spectra taken over the 4100A-5900A range, at a spectral res- 
olution of --^ 700. This survey, and additional systematic studies at 
other wavelengths, led to major new understandings of the emission 
and absorption properties of AGN (BG92, and citations thereafter). 
The BQS survey was followed by the Large Bright Quasar Survey, 
which yielded spectra of 1055 quasars (Hewett et al. 1995), and 
by the 2dF QSO Redshift Survey which yielded spectra for 23 338 
quasars (Groom et al. 2004). The next major step in AGN opti- 
cal spectroscopy came with the Sloan Digital Sky Survey (SDSS) 
Quasar Catalog (latest release QCV, Schneider et al. 2010), which 
produced a sample of quasars a factor of 1000 larger, compared to 
BG92, with a factor of 10 more spectroscopic data per object (a fac- 
tor of three larger wavelength coverage and a factor of three larger 
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spectral resolution). This sample is now a prime resource for stud- 
ies of AGN (see Schneider et al. 2002, and citations thereafter). The 
QCV sample is mainly based on objects selected for spectroscopy 
due to their non-stellar colours, supplemented by SDSS spectra se- 
lected based on other surveys. The QCV sample includes luminous 
objects (M[ < —22.0) which exhibit at least one emission line with 
FWHM> lOOOkms^l. 

Various studies used the SDSS to produce samples of lower lu- 
minosity AGN. At Mi > -22.0 the AGN are generally dominated 
by the host light, which needs to be subtracted to detect the AGN 
continuum and broad line emission. Hao et al. (2005a), who used 
the SDSS second data release (DR2), subtracted the host light based 
on a principal component analysis method. They then modeled the 
Ha profile using one or two Gaussians, and derived the FWHM of 
the broader component. Their type 1 AGN sample includes a total 
of 1317 objects with FWHM > 1200 km s^', and was used to study 
the extension of the luminosity function of type 1 AGN to low lu- 
minosities. The later study of Vanden Berk et al. (2006), based on 
DR3, extended the earlier SDSS quasar sample (Schneider et al. 
2005) to lower luminosity by removing the absolute magnitude cri- 
terion, which yielded 4666 low-luminosity type 1 AGN, used to 
study the relation between the AGN luminosity and the host prop- 
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erties. The definition of a type 1 AGN in Vanden Berk et al. (2006), 
and also in QC V, requires the detection of at least one emission line 
with FWHM > 1000 km s^'. This criterion works well for lumi- 
nous AGN, as the emission line width is dominated by the Broad 
Line Region (BLR) in type 1 AGN, and it excludes type 2 AGN 
where the emission lines are from the Narrow Line Region (NLR), 
where the FWHM < 1000 km s^'. However, as we show below 
lower luminosity type 1 AGN show an increasing relative contri- 
bution from the NLR, and would fail the FWHM > 1000 km s"' 
criterion. Thus, the Vanden Berk et al. (2006) sample becomes in- 
creasingly incomplete with decreasing AGN luminosity. The latest 
SDSS sample of low luminosity type 1 AGN is of Green & Ho 
(2007, hereafter GH07), which is based on DR4. This sample is 
based on the detection of a broad component for the Ha emission 
line in the 6400A — 6700A wavelength region. It includes 8435 ob- 
jects, and was used to study the AGN Black Hole mass (Mbh) func- 
tion. We note that broad Ha with FWHM > 10,000 km s"' are 
selected against in the GH07 sample, due to their restricted search 
region. 

Here we present a new SDSS based sample of type 1 AGN 
which extends to low luminosity, with the purpose of providing a 
useful and well defined database which extends the QCV sample 
to low luminosity, using DR7 (Abazajian et al. 2009). We combine 
this data set with observations in the UV, IR, and X-ray, based on 
GALEX (Martin et al. 2005), 2MASS (Skrutskie et al. 2006) and 
ROSAT (Voges et al. 1999, 2000), in order to study the NIR to X- 
ray Spectral Energy Distribution (SED) of low luminosity AGN, 
and its dependence on various AGN properties. We also derive the 
mean properties of low luminosity AGN host galaxies, and com- 
pare them to the mean properties of inactive galaxies. In following 
papers we study some of the emission line properties of this sample. 

The first study of the IR to X-ray SED of a complete and well 
defined sample of AGN was made by Sanders et al. (1989) for the 
BQS optically selected quasar sample. It showed the overall sim- 
ilarity of the SED of AGN, composed of an IR bump, an optical- 
UV bump, and an X-ray power-law continuum. A follow up study 
was made by Richards et al. (2006) for a small subsample of 259 
quasars from the SDSS, which determined the wavelength depen- 
dence of the bolometric coiTection factor. However, little is known 
about the dependence of the AGN SED on luminosity, Mbh> and 
luminosity in Eddington units L/LEdd (Vasudevan & Fabian 2009; 
Grupe et al. 2010). This dependence may provide important clues 
to the nature of the accretion process (e.g. Davis & Laor 20 1 1 ). 

Here we provide an order of magnitude larger sample of type 1 
AGN with nearly complete detections in most bands. These allow 
us to form an unbiased SED of the sample which extends to low 
luminosity AGN, and to study its relation with various AGN prop- 
erties. A significant limitation is that the SED of low luminosity 
AGN is often dominated by the host galaxy emission. High angular 
resolution, not available here, is required to isolate the low lumi- 
nosity AGN emission from the host galaxy light (e.g. Maoz et al. 
2005). 

In addition, we explore how broad the broad Ha gets. Laor 
(2003) noted that type I AGN with BLR FWHM> 25000 km s"' 
are extremely rare. Various theoretical models of the BLR and the 
ionizing continuum predict an upper limit on the FWHM, beyond 
which the BLR does not exist (Nicastro 2000, Elitzur and Shlosman 
2006, Laor & Davis 201 1). The models differ in the dependence of 
the maximal FWHM on luminosity. Here, we use the new sample to 
explore the FWHM distribution and its dependence on luminosity. 

The paper is organized as follows. In §2.1 - §2.5 we describe 
our type I sample selection. In §2.6 we explore the selection ef- 



fects which arise from our detection procedure, and in §2.7 we add 
photometry from IR, UV, and X-ray surveys. In §3.1 the distribu- 
tion of the type I sample in various emission properties is explored, 
and compared to previous AGN samples, and to samples of inactive 
galaxies. In §3.2 - §3.4, we decompose the mean observed NIR - 
UV SED into the net-AGN and host SEDs. The implied host prop- 
erties are examined in §3.5. In §3.6 we explore the relation between 
vLv(X-ray) and Lbjja- The origin of the SED dispersion around the 
mean SED is explored in §3.7. In §3.8 - §3.9 we investigate the de- 
pendence of the AGN SED on Mbh and L/L^dd- We discuss our 
results in §4, and summarize them in §5. 

Throughout the paper, we assume a FRW cosmology with SI 
= 0.3, A = 0.7 and /fo = 70 km s"' Mpc"'. 



2 SAMPLE CREATION 

2.1 parent sample 

Our puipose is to derive a complete and well defined sample of low 
luminosity broad line AGN, based on the SDSS DR7 (Abazajian 
et al. 2009). The SDSS obtained imaging of a quarter of the sky in 
five bands (ugriz; Fukugita et al. 1996) with a CCD mosaic camera 
(Gunn et al. 1998) coupled to a 2.5-m telescope. The 95% com- 
pleteness limit in the r band is 22.2 mag. Spectroscopic follow-ups 
of the photometric catalog were performed using spectrographs fed 
by 3"-aperture fibres. Targets were selected for spectroscopic fol- 
lowup if one of the following criteria applied to the photometry: 
an extended morphology, which creates the ~ 10 objects of the 
main galaxy sample (Strauss et al. 2002); non-stellar colours or a 
nearby FIRST (Becker et al. 1995) source, which is the base of 
the ~ 10 objects of QCV. Based on the availability of fibers, tar- 
gets with a nearby ROSAT (Voges et al. 1999, 2000) source might 
have also been selected for spectroscopy. The spectrographs cover 
the wavelength range between 3 800 A and 9200A at a resolution 
of ~ 150 km s~', with a typical S/N of 10 pixel^' for a galaxy 
near the flux limit. The flux of the spectra is calibrated by matching 
the spectra of simultaneously observed standard stars to their PSF 
magnitude (Adelman-McCarthy et al. 2008). On each spectrum, the 
SDSS pipeline measures the emission lines, and then determines 
the redshift z and the classification of the spectrum, according to 
the locations of emission lines, or according to correlations with 
template spectra. 

We apply several initial filters to the DR7 database: 

(i) The spectrum is classified as non-stellar. 

(ii) z < 0.31, to ensure the continuum level can be measured at least 
20000 km s^ to the red away from Ha, so that very broad Ha 
lines with FWHM < 25000 km s"' are detectable. 

(iii) z > 0.005, to ensure reliable distance estimates based on the as- 
sumed Ho, without significant errors due to peculiar motions. 

(iv) A S/N > 10 in the spectrun^J, to improve the accuracy of the 

NLR/BLR decomposition. 
(v) A percentage of pixels near Ha indicated as 'bad' in the pixel 
mask (Stoughton et al. 2002) which is no more than 0%, 20% or 
50%, depending on the distance from Ha (see appendix Al). The 
main source of bad pixels is poor sky subtraction, which degrades 
the spectrum mainly at 8OOOA < X. 



' Based on median S/N of spectram pixels covered by the g, r, i bands 
(available in the SDSS database). 



A total of 232 837 spectra, out of the 1.6 million spectra in 
DR7, pass these five criteria. We dub them as the 'parent sample'. 
The spectra are then corrected for foreground dust, using the maps 
of Schlegel et al. (1998) and the extinction law of Cardelli et al. 
(1989). We now need to determine which of these objects host a 
broad line AGN. 



2.2 Host subtraction 

The rather large 3" fibre aperture of the SDSS spectra implies that 
a significant fraction of the host light is typically included, which 
needs to be removed in order to identify low luminosity AGN. As 
discussed below (§2.3), the AGN are identified through the detec- 
tion of a broad Ha component. To detect this component we need to 
subtract the broad stellar absorption features, and derive a smooth 
continuum. In particular, in the vicinity of the Ha feature, there are 
flanking TiO bands (Bica & Alloin 1986, and see appendix A2), 
which need to be properly subtracted off. 

To roughly model the absorption features, we follow an ap- 
proach similar in principle to that described in Hao et al. (2005a). 
We utilize the galaxy eigenspectra (ESa) presented in Yip et al. 
(2004; hereafter Yip04), which were derived from a principle com- 
ponent analysis of SDSS galaxies. A linear sum of the first three 
ESa and a power law component with L^ oc A^'-^, representing 
the AGN continuum, is fit to wavelength regions which lack the 
strongest galaxy and AGN emission lines (see appendix A2). Note 
that the four components of the fit are non-orthogonal, in contrast 
with the original ESa of Yip04. 

In practice, light emitted by hot stars near Ha is degenerate 
with the power-law component, as can be seen in the third ES of 
Yip04 (fig. 5 there). This degeneracy produces an unrealistic re- 
sult that the mean fraction of continuum emission emitted by AGN 
in all galaxies is ~ 5%. In the more thorough PCA decomposi- 
tion performed by Vanden Berk et al. (2006), in which they use 
also eigenvectors derived from a sample of quasars, they found that 
host- AGN separation is unreliable if the contribution of either com- 
ponent is < 10%. This problem probably also arises from the non- 
orthogonality of the base spectra, as we find here. Note that if one 
excludes objects in which one of the components is weak, a correla- 
tion between the host and AGN luminosity is introduced. To avoid 
these complications and uncertainties, we use our PCA results only 
to subtract the stellar features and search for a broad Ha, and not 
to derive the host and AGN emission. 

The linear sum of the three ESa is subtracted from the 
SDSS spectra in the wavelength ranges 4800A-5250A and 6125A- 
7000A, to derive a featureless continuum near the Ha, H/3 and 
[O III] lines. Note that the Yip04 ESa include naiTow emission 
lines from H 11 regions, with typical FWHM of ~ 250 km s^', 
which should be avoided. We interpolate over all the strong nar- 
row features in the ESa prior to their subtraction (note the narrow 
lines do not participate in the fit). Since H/3 in ES 1 and 2, and 
Ha in ES 1, reside in wider stellar absorption features (FWHM 
^ 1300 km s^'), there may be additional naiTower stellar absorp- 
tion features, which are lost by the interpolation, and not properly 
subtracted off with the other stellar absorption features. The possi- 
ble amplitude of this effect is small, and is discussed in following 
papers on the narrow emission line properties. 

In 1% of the spectra in the parent sample only poor fits (re- 
duced X > 2) are obtained. The algorithm described below might 
falsely detect a broad Ha feature when the fit is poor, and such 
false detections are filtered in the final stage of the pipeline (§2.5). 
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2.3 Candidate broad Ha objects 

A broad emission line is a unique signature of AGN emissiorQ We 
search for a broad Ha line, as Ha is the strongest AGN line in the 
visible region. Identifying AGN through the detection of a broad 
Ha line is easier than the detection of a power-law component, 
as the average flux density of the broad Ha feature is ~ 5 times 
larger than the power-law continuum flux density (Fig. 1). Also, as 
discussed above, the power-law component is degenerate with the 
continuum of young stars. To identify a broad Ha line we search 
for excess flux above the continuum in the vicinity of Ha, which 
cannot be attributed to be part of the narrow emission line profiles. 

After subtracting the stellar emission, the featureless contin- 
uum is derived by linearly interpolating the mean continuum level 
at the continuum windows at 6125A-6250A and 6880A-7000A, 
which are ~ 300A, or ~ 14000 km s^' away from Ha. The con- 
tinuum is subtracted, and a broad Ha is looked for in the residual 
spectrum. The residual flux at 6250A-6880A is summed, excluding 
regions 20-pixel wide (I 380 km s^') centred on the [O I], [S II], 
[N II] and Ha narrow emission lines. Also, we exclude excess flux 
detected away from Ha, with a region at least ten pixels wide with 
null mean residual flux, as it is implausible that such distinct emis- 
sion features originate from a broad Ha emission feature. Bad pix- 
els are interpolated over. The total excess flux is denoted by AF. 

The significance of AF is assessed by comparing it to (Tdatai 
the dispersion in the flux density at 6I25A-6250A and 688OA- 
7OOOA, which incorporates both the S/N of the spectrum and resid- 
ual stellar features which were not fully subtracted in §2.2. A total 
of 6986 (3%) objects with AF/cTjatg > 2.5, from the parent sam- 
ple of 232 837 objects, are found as potential broad-line AGN, and 
were passed on to the next stage of the pipeline. A depiction of 
this process, and the reasoning behind the 2.5 threshold, appear in 
appendix A3. 



2.4 Broad Ha and narrow lines fit 

The purpose of this stage is to fit the excess flux near Ha by a 
broad line profile. We measure the profiles of various forbidden 
lines, [O III] A5007, [O I] 116300,6363, [N 11] 116548,6583 and 
[S II] 116716,6730, which provide the narrow lines profile, and 
help us separate the broad and narrow components of Ha. We use 
4*-order Gauss-Hermite functions (GHs; van der Marel & Franx 
1993) to fit the naiTow line profiles, and an up to 10*-order GH to 
fit the broad Ha profile, which can have a complex shape, such as 
a double peak. The Levenberg-Marquardt best-fit algorithm (Press 
et al. 1992) is used in all fits. The fit is used to measure the lumi- 
nosities and mean velocities of all the lines mentioned above, and 
the higher moments are measured only for the broad Ha and for 
[O III]. 

An overview of the fitting process is given below, and further 
details appear in appendix A4. The narrow line results are used in 
following papers, so a more elaborate analysis of their fit procedure 
and its implications is given there. 

The wavelength region near [O III], 4967A-5250A, is fit with 
three components: an iron template derived from observations of 
I Zw 1 (kindly provided by T. Boroson), a GH fit for the [G III] line, 
and a continuum component. The iron template can broaden and 



^ Excluding the rai'e examples of broad Ha due to supemovae (see figures 
13 & 14 in Filippenko 1997). Only a handful of such objects reached the 
final sample (§2.5). 
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shift during the fit. Following that, all the lines between 6250A- 
688OA noted above are fit, using a 4*-order GH for the broad Ha. 
To lower the degrees of freedom, the widths and higher moments of 
all the narrow lines in this region are kept equal. In the cases where 
all these narrow lines are too weak to significantly constrain their fit 
(14% of final sample, see §2.5), we tie their widths and higher mo- 
ments to that of [O III], which is detected in 99% of the objects in 
the final sample. Then, the broad Ha is fit with increasingly higher 
order GHs, until an acceptable fit is achieved. Finally, the narrow 
Hj3 line is also fit, keeping its centroid, width and shape equal to 
that of the narrow Ha. Fitting the broad H/3 profile is beyond the 
scope of this paper. 

Following the above process, 3% of the objects have an un- 
acceptable reduced ;i;^ > 2 in the 6250A-6880A region. The treat- 
ment of these objects is detailed in appendix A4. 

Figure 1 shows various examples of fits. The three upper pan- 
els show objects with a range of broad to narrow Ha peak flux den- 
sity ratio. The upper panel shows a 'standard' Seyfert 1.0, where 
the broad Ha dominates, with a flux density ratio at the line core 
of ^A;bHa/^A;nHa — 3- Note the extended wings of the broad Ha 
profile, beyond a simple Gaussian, which are fit by higher order 
GH components. The second panel shows a Seyfert 1.5 (Oster- 
brock 1981), where the narrow Ha peak is higher than the broad 
Ha peakF;t.t,Ha/^A;nHa - 0-5- The FWHM of the total Ha profile 
is dominated by the narrow component. Such an object is therefore 
excluded from QCV and the Vanden Berk et al. (2006) sample of 
broad line AGN, although it is clearly a broad line AGN. The third 
panel shows a Seyfert 1.9 (^A;bHa/^A;nHc( — 0.05), where a very 
broad Ha is still detectable. The broad Hj8 is unobservable, as ex- 
pected. Such an object will be defined as a Seyfert 2, if only H/3 is 
available, although the EW of Ha (before the host subtraction) is 
28A, which may indicate this is actually an unobscured broad line 
AGN. A full analysis of narrow to broad flux ratios and their de- 
pendence on AGN characteristics is deferred to a following paper. 

The bottom panel is an example of an object with no clear 
transition between the B aimer narrow and broad line profiles. Ad- 
ditionally, all narrow lines near Ha are undetectable. In such ob- 
jects, we use the [O III] 15007 profile as a template profile for the 
narrow Balmer lines. 



2.5 Final selection criteria 

The purpose of this stage is to determine if the excess flux 
found above in 6 986 objects is produced by a broad Ha. As the 
AF/Odata = 2.5 threshold is chosen to minimize the rejection of 
broad line AGN (appendix A3), many objects with larger AF/Ctjata 
lack a clear broad Ha feature. To filter out such cases, we employ 
the additional following selection criteria: 

(i) A line width 1 < Aviooo < 25, where Aviooo = Av/1000 and Av is 
the Ha FWHM in km s^ ' . The upper limit filters out cases where a 
very broad GH component essentially fits the continuum. The lower 
limit filters out a very narrow GH component which fits the narrow 
Ha profile. 

(ii) A normalized excess flux density at the line centre Fji.bea/^A > 2, 
where ex is the SDSS flux density error. This ensures significant 
flux density excess at the expected centre of Ha, in addition to the 
above requirement of significant total flux. It filters out significant 
excess flux placed away from the position of Ha. Also, it excludes 
broad and very shallow emission features produced when the con- 
tinuum is not properly placed (e.g. due to residual stellar absorption 
features, §2.2). 
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Figure 1. Examples of the range of Ha and H/3 profiles observed in the 
host-subtracted spectra. The black lines are the fits to the broad Ha (8*- 
10* order GH functions) and to the narrow lines (4"'-order GHs). Dotted 
lines mark the FWHM of the Broad Ha. (a) Seyfert 1.0, Fi;bHa/FA;nHa - 
3: note the extended wings of the BL profile compared to a Gaussian, fit 
by high-order GH components, (b) Seyfert 1.5, FA;bHa/Fl;nHa — 0-5: the 
FWHM of the total Ha profile is dominated by the narrow Ha, therefore 
this object is excluded from the SDSS quasar catalog and the Vanden Berk 
et al. (2006) sample, (c) Seyfert 1.9, FA;bHa/Fl;nHa — 0.05: such an object 
will be defined as a Seyfert 2 if only H/3 is available, despite an EWbUa = 
28A, which may indicate this is an unobscured broad line AGN. (d) A type 
1 AGN with no clear transition in the profile between Fx;bHa ^nd F^.^jja- 
The Balmer NL profiles are based on [O III], since [N II], [S II] and [O I] 
are undetectable. 



(iii) A revised excess flux criterion AF'/ajj^ta > 4, rather than 
AF/a(jata > 2.5 used earlier (§2.3), where AF' is obtained from 
the broad Ha profile fit. Flux at A less than 3(T away from the peak 
positions of the narrow lines is excluded in the summation of AF' 
(using the (T and peak fit to the narrow lines). This filters out ob- 
jects in which the original excess flux may come from a broad base 
of the forbidden lines, rather than from the BLR. 

A total of 3 602 objects with acceptable fits (reduced x^ < 2) 
pass these criteria. The selection effects produced by the above cri- 
teria are discussed below. To verify the reality of the broad Ha 
detection, we also inspected by eye all the 3 602 spectra, and ex- 
cluded further 93 objects where there was no clear BLR, i.e a false- 
positives rate of ~ 3% for our broad Ha detection algorithm. Also, 
80 objects with high x have an acceptable fit, as described in 
appendix A4. Finally, we inspected by eye the images of all the 
z < 0.03 objects and removed three objects which were offset from 
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the centre of their host galaxies. The size of the final sample, here- 
after the 'Tl sample', is 3 579 objects. 

Table 1 lists the broad Ha luminosity (LbH^), and Aviooo for 
each object in the Tl sample. Our rejection criteria may have been 
too restrictive, and potentially excluded objects with an unusual 
broad line profile. We therefore provide in appendix B a table which 
lists the other 3 407 excess flux objects that were rejected from the 
Tl sample, along with the reason for rejection. This list may be 
useful for future studies. 



2.6 Selection effects 

Since the Tl sample is selected based on the detection of a broad 
Ha, it is important to understand how the sample selection crite- 
ria (§2.3 and §2.5) affect the range of detectable line widths, as a 
function of line flux. 

Figure 2 presents three panels with the distribution of the T 1 
sample objects in the broad Ha flux (FbHce) vs. Aviooo plane. In 
each panel the coloured objects mark the 4% of the objects which 
are just above a given selection threshold. The top panel marks the 
positions of 4% of the objects where 2.5 < AF/a^ata < 6.4. The 
selection of AF/<T(jata > 2.5 affects the lowest detectable Aviooo- As 
FbHa decreases below 10^ erg s^ cm^ , the minimal detectable 
Aviooo increases from 1 to 3. As the excess flux becomes lower, the 
line needs to be broader, otherwise it is lost in the glare of the NLR 
Ha component. Selection criterion (iii) in §2.5 of AF'/c7data > 4 
(not plotted), produces a similar effect. 

The middle panel marks the positions of objects where 2 < 
Px bHa/^A < 3-6- This selection produces a reverse effect. As FbHce 
decreases below 10^ erg s^' cm^ , the maximum detectable 
Aviooo decreases from 10 to 3. As the excess flux density becomes 
lower, the line needs to be naiTower, otherwise it is lost in the con- 
tinuum noise. 

The lower panel marks the positions of objects with 10 < 
S/N < 17.1. Clearly, this selection criterion does not form any of 
the boundaries in the F(,Ha vs. Aviooo plane. Of the 93 objects re- 
moved by eye (§2.5), 85% have FbHa and Aviooo values which co- 
incide with the 2.5 < AF/c7(jata < 6.4 objects (top panel), indicat- 
ing the manual filtering does not introduce an additional boundary 
in this plane. 

A similar analysis in the EWbHa vs. Aviooo plane gives similar 
results. The minimum detectable Av rises from 1 to 3 for EWbHa < 
40A, and the maximum detectable Aviooo decreases from 10 to 3 
for EWbHa <20A. 

The objects in Fig. 2 extend towards Avjooo = 1, in particu- 
lar at FbHa < lO^'-''^ erg s^' cm^^. Thus, the selection criterion 
Aviooo > 1 rnay exclude some type 1 AGN which have Aviooo < 1- 
What is the possible missed fraction of AGN? We have excluded 
149 (= 3579 X 4%) objects that were fit with < Aviooo < 1, due 
to possible confusion with emission from [N II], some of which 
may be true broad line AGN with Aviooo < 1. The expected frac- 
tion of Aviooo < 1 AGN can be also estimated from the GH07 sam- 
ple, where also 3% of the objects have Aviooo < 1- Since a primary 
motivation of GH07 was to find broad line AGN with the lowest 
Aviooo (low Mbh, see Greene & Ho 2004), it is likely to be as com- 
plete as possible, and implies the Tl sample likely misses the 3% 
of AGN in the SDSS with Aviooo < 1 > which may constitute part of 
the objects with an apparent Aviooo < 1 excluded here. 

The Aviooo < 25 selection criterion has practically no effect 



as very few objects reside at Aviooo > 10 (only one object with 
Aviooo > 25 is detected her4_|). 



2.7 Additional surveys 

We supplement the optical SDSS spectra in the Tl sample with 
photometric measurements in the UV, Near IR and X-ray. In order 
to include all AGN light, and to be consistent with the luminosity 
calibration of SDSS spectra (Adelman-McCarthy et al. 2008), we 
use luminosities from fixed apertures (different for each survey), 
corrected for the expected full-to-aperture light ratio of a point 
source. Note that for extended objects, the deduced luminosity is 
different than the luminosity obtained by a PSF fit to the image. 
This procedure is preferred over using galaxy-model fits, which 
perform poorly and differently for different AGN/host light ratios 
(e.g. §4.4.5 in Stoughton et al. 2002). Thus, comparing the AGN 
luminosity of our Tl objects to PSF luminosities of distant quasars 
(§3.3) is justified. Host luminosities are compared only with ob- 
jects that were observed in the same way (§3.3 - §3.5). The specific 
details of each survey are described below, and the photometric lu- 
minosities of each object are listed in Table 1. 



2.7.1 GALEX 

The GALEX mission (Martin et al. 2005) imaged 2/3 of the sky 
with two broadband filters: far-UV (FUV; effective wavelength 
I528A) and near-UV (NUV; 2271 A). We use the GR6 data release, 
described in Morrissey et al. (2007) and in the MAST websitqj. 

We search for GALEX detections in both bands, within 5" of 
the Tl objects. In the case of multiple GALEX detections per ob- 
ject, we co-add the observations, weighted by exposure time. Fig- 
ure 3 presents the detection fractions of the 89% of the Tl objects 
observed by GALEX, as a function of broad Ha flux (FbHa) for 
point sources and for extended sources. The point sources in the 
sample are effectively complete in the UV. A trend of decreasing 
detection fraction with decreasing FbHa can be seen in the extended 
objects, but the detected fraction remains > 82%. If we include 
NUV-only detections, 98% of the Tl objects observed by GALEX 
are detected. 

We use the UV fluxes of the 6"-radius 'aperture 4', so that 
the residual correction for the PSF is < 10% (Morrissey et al. 
2007). We correct for the PSF and for Galactic reddening using 
'4nuv/F(S -V) = 8.2 and Afuv/F(S - V) = 8.24 (Wyder et al. 
2007). When a specific rest-frame UV wavelength is used, we in- 
terpolate (or extrapolate) a power-law from the measurements of 
the two bands. 



2.7.2 2MASS 

Near infrared photometric information was obtained from the point 
source catalog of the Two Micron All Sky Survey (2MASS; Skrut- 
skie et al. 2006), which imaged 99.998% of the sky in the J (1.25 
/im), H (1.65 ^m), andKs (2.16 /xm) bands to a sensitivity limit of 
15 mag in the H-band (for point sources with S/N ~ 10). We use 
the published luminosity from a 4"-radius aperture, fixed for the 
residual of the PSF. 

For each object in the Tl sample, we search for a 2MASS 
counterpart using a matching radius of 2". We filter 2MASS objects 



' SDSSJ094215.12-I-090015.) 
* http://galex.stsci.edu/GR6 



see Wang et al. (2005). 
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SDSS Name 


7 


ibHa 


Av 


Ks 


H 


J 


NUV 


FUV 


2keV 


J0()0202.95-103037.9 


0.103 


41.91 


2320 


44.13 


44.19 


44.23 


43.83 


43.83 


42.58 


J000338.94+16022().6 


0.116 


42.19 


2080 


44.14 


44.23 


44.16 


43.59 


43.49 


42.49 


J000410.80-104527.2 


0.240 


42.63 


1370 


44.55 


44.63 


44.47 


44.65 


44.69 


43.22 


J00061 1.55+145357.2 


0.119 


42.12 


3300 


44.18 


44.24 


44.30 


43.94 


44.03 


42.60 


J000614.36-010847.2 


0.090 


41.60 


3960 


43.77 


43.86 


43.89 


43.30 


43.26 


0. 



Table 1. The broad Ha luminosities (in log erg s"'), widths (in km s"') and 2MASS, GALEX and ROSAT vLy luminosities (in log erg s"') of the Tl 
sample. GALEX and 2MASS luminosities are in observed frame. Non detections are denoted by '0'. For 2MASS and GALEX, a detection in all bands of the 
survey is required. An object that was not observed is denoted by '-1' (GALEX only). The electronic version of the paper includes all 3 579 objects. 



with a risk of nearby star contribution ('ccflag' = s,p; Cutri et al. 
2003) or an upper limit in either of the three bands. We find that 
96.5% of the Tl sample have such matches (Fig. 3). We correct 
for Galactic reddening using the A/E(B — V) values in Schlegel 
et al. (1998). When a specific rest-frame wavelength in the NIR is 
needed, a power-law is interpolated (or extrapolated) from the two 
nearest 2MASS bands. 



2.7.3 ROSAT 

The ROSAT All-Sky Survey (RASS; Voges et al. 1999, 2000) cov- 
ers the entire celestial sphere in the 0.1 — 2.4 keV range with the 
Position Sensitive Proportional Counter (Pfeffermann et al. 1987) 
to a typical limiting sensitivity of ~ 10^ erg s^' cm^ . Typical 
positional uncertainties are 10" — 30", and the detected source den- 



sity is ~ 3 deg^ . Only 0.4% of the Tl sample was targeted for 
spectroscopy by the SDSS solely for being near a ROSAT source, 
and not as part of the complete SDSS Galaxy and Quasar surveys 
(see §2. 1 and Stoughton et al. 2002). Therefore, we do not expect 
the Tl sample to be biased towards X-ray bright AGN. 

We search for a RASS source within 50" of each Tl object, 
expecting 7 objects to have false matches. We find matches to 1 545 
(43%) of the Tl sample. The detection fraction vs. Fbna is shown in 
Figure 3. The strong trend of detection rate with F^Ha implies that 
most of the objects without an X-ray detection are not a distinct 
class ('X-ray weak'; Laor et al. 1997), but are rather AGN faint in 
all bands. 

We use the PIMMS software to convert RASS count rates into 
X-ray fluxes, assuming a power-law X-ray spectrum with oix = 1 -5 
(Laor et al. 1994, Schartel et al. 1996). Then, using the z of the op- 
tical match, cHx = L5, and the Galactic A'h measurements of Stark 
et al. (1992), we derive the rest-frame vLv(2 keV) of the matched 
objects, corrected for Galactic absorption. 



3 RESULTS 

3.1 The final sample 

Figure 4 presents LbHa vs. z of the final Tl sample. The figure dis- 
plays the 3 077 objects at z < 0.2. Empty circles mark the 423 ob- 
jects (14%) that also appear in the QCV sample. The QCV sample 
objects mostly reside at logLbHa ~ 42.5 — 43, while the Tl sample 
extends further down to logLbHa ~ 40. The Tl sample includes ad- 
ditional 502 objects at 0.2 <z< 0.31 not displayed in Fig. 4. In this 
range there is an overlap of 88% of our objects with the QCV sam- 
ple, but our sample includes only 24% of the QCV objects in this 
range. This small fraction is due to our requirement of high quality 
data near Ha, which is satisfied by a decreasing fraction of the ob- 
jects as Ha shifts into the noisy red end of the SDSS spectra (§2. 1). 



The total Tl sample extends over the range logLbHa ~ 40 — 44, and 
becomes distinct from the QCV sample at z < 0.2 . 

Figure 5 compares the ranges of Lbna spanned by the Tl sam- 
ple with the z < 0.35 QCV sample (where Ha is observable). Our 
sample extends down by nearly two orders in magnitude in LbHc(> 
compared to the QCV sample. Specifically, the Tl sample includes 
1 878 objects at Lbea < lO'*^ erg s"', compared to 207 in QCV, 
and 297 objects at L\,iiqi < 10'*' erg s^', compared to 5 in QCV. 
We also show the Ho et al. (1997a) sample (44 objects, updated 
values in Ho 2003), which extends below 10 erg s^ , while the 
Tl sample terminates just below 10^*^ erg s^'. 

In this paper we explore the emission properties of the Tl 
sample as a function of various parameters, including Mbh and 
^/^Edd- We derive these parameters from Ha, following Greene 
& Ho (2005) and Kaspi et al. (2005). We use the relation between 
the BLR radius and UV luminosity in Kaspi et al. (2005), 



Ri 



bHp 



\0.545±0.06 



light days 



(1) 



(17.6 ±0.26) X vLv(1450A)44 

We then use the relation vLv(1450A) = 33 x L^j5„ (eq. 5 below), 
and Greene & Ho (2005, eq. 3 there) to convert FWHM(broad Hj8) 
to FWHM(broad Ha), which gives 

Mbh 



;log 



Mc 



:7.4 + 2.061ogAviooo + 0.5451ogLbH, 



a, 44- 



(2) 



We use LbHa to obtain Lboi using the relation Lboi = 130 x L^'^^ 
(eq. 6 below), which gives 

L 



Z = log 



^Edd 



:0.6-2.061ogAviooo + 0.4551ogLbHa,44 



(3) 



Note that eq. 2 is equivalent to eq. 6 in Greene & Ho (2005), but is 
slightly different as it uses the Kaspi et al. (2005) relation, and the 
relations between LbHa and continuum luminosity obtained below. 

It is important to understand how the sample selection crite- 
ria affect the observed range of various AGN parameters. Figure 
6 shows the observed distribution of the Tl sample objects in the 
^bHa vs. Av plane. In the top panel, the larger (coloured) dots in 
this figure present the objects close to our selection cutoffs (see 
Fig. 2). The m and / diagonal lines are derived from eqs. 2 and 3. 
It is clear that m < 6 objects are heavily selected against due to the 
AF/(Tdata > 2.5 criterion, while the F";L,bHa/£A > 2 criterion selects 
against I < —2.8 objects. 

Also, we can now identify the effect of the rarity of active 
very massive black holes (m > 9) in the local universe, which man- 
ifests as a lack of high-Av, high-LbHa objects. Another limit on the 
distribution is due to the Eddington luminosity, which leads to an 
increasing minimum Av with LbHa for LbHa > 10^^'^ erg s^' . 

The lower panel shows Av as a function of Lboi- Here we add 
the 8 185 QCIV objects from Shen et al. (2008), where the broad 
H/3 is available (z < 0.66). We fix for a mean offset of 0.13 dec 
(with a dispersion of 0. 10 dec) between the Shen et al. FWHM(HjS) 
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Figure 2. The selection criteria effects on the distribution of the Tl objects (small gray dots) in the broad Ha flux vs. FWHM plane. Large coloured markers 
are used to plot the 4% of the Tl objects which are just above the following three selection thresholds. (Upper panel) The AF/fJdata > 2.5 selection criterion. 
At FbHa < 10~ erg s^ cm^~, this selection increases the minimal detectable Avjooo from 1 to 3. (Middle panel) The fA.bHa/^A > 2 selection criterion. 
At i^bHa < 10^''' erg s"' cm^^. this selection decreases the maximal detectable Aviooo from 10 to 3. The (Lower panel) S/N > 10 selection criterion. 
This selection criterion does not form any boundary in the FbHa vs. Aviooo plane. The Tl objects extend down to Aviooo = U in particular at FbHa < 
jQ-13.5 gj.g g-l cm"-, and objects with Aviooo < 1 are excluded (~ 4%, see text). The Aviooo < 25 criterion has practically no effect, as objects with 
Aviooo > 15 are extremely rare. Our FbHa detection limit is lowest for Aviooo ~ 3 objects. 



measurement and our FWHM(Ho!) measurement, derived based on 
common objects. The value of L^\ in the Tl sample is derived 
from eq. 6, and in the Shen et al. sample L\„\ is derived from 
vLv(5100A). The plots shows clearly how the Eddington luminos- 
ity limit extends to quasar luminosities. The Shen et al. FWHM 
measurements extend to higher luminosities using FWHM(Mg II) 
or FWHM(C IV) available for higher z objects. We avoid utiliz- 



ing these measurements since the relation of FWHM(Mg II) to 
FWHM(Hj3) is not linear, and the FWHM(C IV) shows only a very 
weak correlation with FWHM(Mg II) (e.g. Shen et al. 2008, Fine 
et. al. 2010). Note that with increasing L\,o\ the range in the ob- 
served Av shrinks due to the upper limits on / and m. A similar 
effect is seen for Mg II (Fine et. al. 2008), and C IV (Fine et. al. 
2010). In fact, no broad line objects are expected to exist at a high 
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Figure 5. Comparison of LbHa ranges of different broad line AGN sam- 
ples. We include only z < 0.35 objects of the QCV sample, where Ha is 
observable. Although the Tl sample extends down by two orders of magni- 
tude in LbHa. compared to QCV, it does not extend as low as the Ho et al. 



(1997a) sample, which reaches LbHa < 10 
for clarity). 



' erg s (magnified ten-fold 



Figure 3. The detection fractions of the Tl objects in the GALEX (UV, 
blue), 2MASS (NIR, red) and ROSAT (X-ray, black) surveys, as a function 
of Fbua, for extended and point like sources. Detections in all bands of 
each survey is required. Solid/dashed lines represent extended/point source 
objects. The lower panel shows the number of Tl point-like and extended 
objects per 0.5 decade FbHa bin. The GALEX detection fraction is relative 
to the 89% of Tl objects observed. Note the nearly complete 2MASS and 
GALEX detections. When analyzing the X-ray luminosities (§3.6 and §3.9), 
we utilize only FbHa > 10^'^^^ erg s"' cm"- objects, where the detection 
fraction is > 50%. Note that most of the lowest FbHa Tl AGN reside in 
extended objects. Their bright host enables them to pass the SDSS flux limit, 
despite their low FbHa ■ 
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Figure 4. The LbHa versus z distribution for the Tl sample at z < 0.2 (3 077 
objects). Einpty circles mark the 423 objects (14%) that also appear in 
the SDSS quasar catalog (QCV). Note that the QCV sample terminates at 
LbHa of a few lO''- erg s"', while the Tl sample extends further down to 
10"° ergs-'. 



enough L^oi due to the Eddington limit. This merely reflects the 
fact that the minimal required Mbh increases with Lboh and AGN 
above some Mbh just do not exist. 



3.1.1 The Av distribution 

How broad do the broad lines get? The lower panel of Fig. 6 
shows there is a steep decline in the number of objects with Av > 
10000 km s^'. The decline is independent of L^^i, over four or- 
ders of magnitude in Lboj. The dechne does not result from our 
selection criteria, as shown in the upper panel. Figure 7 displays 
histograms of the FWHM distribution cW/dlogAv vs. Av, at four 
luminosity bins. The distributions are remarkably similar, showing 
a roughly linear decline of log (d/V/dlogAv) vs. Av, or equivalently 
dA'/dlogAv °= g-'^^M^o^ with Avq fs 2700 km s'K The distribu- 
tions peaks at Av ~ 2500 — 4000 km s^', and drops steeply for 
smaller values. The sharp drop at the smallest Av, and the increase 
in Av of the peak position with Lbol. is due to the Eddington limit 
(/ = 0, see Fig. 6), which gives a minimal Av oc Ljjj|^ (eq. 3). The 
similar distributions for Av above the peak is surprising, as the Av 
distributions are set by the distribution of the number of objects vs. 
I and vs. m. It is not clear why these two distributions combine to 
yield the same Av distributions at different Lboj. It may imply that 
the distributions of Av is directly set by the value of Av, but it is not 
clear what may be the physical mechanism behind a dependence on 
Av only (see further discussion in §4.1). 



3.1.2 The L/L^dd distribution 

Figure 8 compares the Eddington ratio distributions of the T 1 sam- 
ple with the BQS (using the BG92 z < 0.5 subsample, with values 
taken from Baskin & Laor 2005) and the KoUmeier et al. (2006, 
hereafter K06) quasar samples. Both the BQS and K06 were se- 
lected as point sources, in contrast with the Tl sample, which in- 
cludes both point sources and extended sources. The Tl sample is 
centred around / ~ — 1.3, while the BQS and the K06 samples are 
centred around / ~ —0.6. To determine if the higher / distributions 
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Figure 6. (Upper panel) The distribution of the Tl sample objects in the L(,Ha vs. Av plane. Large coloured markers are as in Fig. 2. Objects with m < 6 or 
/ < —2.8 are selected against. The Eddington limit (/ = 0) sets a minimum Av with increasing LtHa for LtHa > 10^"^ erg s^'. This, together with the rarity 
of AGN with m > 9, leads to a decrease in the range of observed Av values with increasing L^hh . (Lower panel) The distribution of the T 1 sample + QCIV 
sample (values from Shen et al. 2008, see text) in the l^„i vs. Av plane (L|,„i is from eq. 6). The Eddington limit is clearly seen. Note there is a similar steep 
decline in the number of objects with Av > 10000 km s"', at all luminosities. The decline is not set by our selection criteria. 



are driven by the point source selection, we show the / distribution 
of the point sources in our Tl sample. The Tl point sources are also 
shifted to higher / values. This clearly demonstrates that the higher I 
values of BQS and K06 samples are largely driven by the point-like 
selection effect. This trend is well understood by the Mbh versus 
bulge luminosity relation (Magorrian et al. 1998), which implies 
that L/L^dd is equivalent to i-AGN/^bulge- Thus quasars, i.e. objects 



which are selected by their quasi-stellar appearance, are by their 
selection high L/Lgdd AGN. 

Below we compare Tl hosts to other galaxy types (in §3.1.4 
and §3.5), where we use the subsample of 2 766 Tl AGN which are 
part of the SDSS main galaxy sample (Strauss et al. 2002). These 
objects are selected to have extended morphologies, and therefore 
by selection exclude high L/L^dd objects (Fig. 8). 
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Figure 7. The FWHM distribution of the combined Tl + QCIV sample 
(Shen et al. 2008), presented in Fig. 6, at different Ltoi bins. The distiibu- 
tions are similar. The origin of this similarity is not clear. The Av distribu- 
tions should be set by the distribution of m and / values. Either these distri- 
butions somehow lead to a Av distiibution which is independent of L(,oi , or 
it may imply that Av, rather than m and /, sets the observed Av distribution 
through some unknown mechanism. Note the effect of the Eddington limit 
at Aviooo < 3, which increases the minimal Av and the peak position with 
increasing L^\. 
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Figure 8. A comparison of the Eddington ratio distribution in various sam- 
ples. The / bin size is 0.25 decades. Both the BQS and the K06 samples 
were selected to be point sources. The Tl distribution is shifted to signifi- 
cantly lower / values. To test the origin of this shift we plot the distribution 
of the Tl objects originally identified as point like sources, which is clearly 
shifted to higher / values. Therefore, the notion that AGN tend to shine 
at / ~ —0.5 (K06) reflects the point like selection criterion. AGN tenned 
quasi-stellar have a high / by construction. 



3.1.3 Other low luminosity type 1 samples 

Below we compare the completeness of the Tl sample to the other 
SDSS based samples of low luminosity type 1 AGN of Hao et al. 
(2005a), Vanden Berk et al. (2006) and GH07. 

The Vanden Berk et al. (2006) sample is based on the SDSS 
DR3 and includes 4666 objects. The AGN selection criterion is 
the detection of at least one emission line profile with FWHM 
> 1000 km s^'. This criterion excludes AGN where the narrow 



component dominates the Balmer line profiles (e.g. panels 'b' and 
'c' of Fig. 1). As a result, the Vanden Berk et al. (2006) sample 
misses 27% of the Tl objects present in SDSS DR3. 

The Hao et al. (2005a) sample is based on SDSS DR2 and 
includes 1 317 objects, of which 52.5% (692 objects) entered our 
parent sample (passing our S/N> 10 and bad pixel criteria). Hao 
et al. fit the narrow and broad Ha profile with a simpler procedure 
than described in §2, using one or two Gaussians. Of their objects 
that entered our parent sample, 27% were rejected due to a lack of 
broad line flux (§2.3), and further 20% were rejected based on the 
excess flux shape (§2.5). A similar percentage of 47% of the Hao 
et al. (2005a) sample were rejected by GH07. We therefore suspect 
that these broad Ha detections by Hao et al. are not significant. 

The GH07 sample is based on SDSS DR4 and includes 8495 
objects. Our selection method of excess broad flux near Ha is sim- 
ilar to the GH07 selection criterion. The broad Ha detection rates 
in the Tl and GH07 samples are both 1.5%. However, GH07 fit the 
broad Ha feature in a smaller velocity range of —7.4 < vjooo < 6.3 
from the Ha peak position, compared to —14.3 < viooo < 14-5 
used in the Tl sample. Thus, we find that 1.6% of our objects 
(58/3 596) have Avjooo > 10, compared to 0.51% (43/8495) in 
GH07. Also, 17% of the 2 273 GH07 which appear in our parent 
sample, were rejected from the Tl sample, potentially due to more 
strict selection criteria applied here. Comparing the measured Av 
values in individual objects which appear in both samples, we find 
that the Tl values are on average a few percent larger, with a mean 
offset of Av — Avgho? = 222 km s^' . The RMS dispersion of the 
differences in the individual Av values is 875 km s^' , and the RMS 
of the fractional difference in Av is 20% (0.13 for log LbHa)- As 
noted in §2.6, 3% of GH07 have Aviooo < 1> while such objects are 
excluded from the Tl sample. 



3.1.4 The fraction of SDSS objects which are Tl 

It is also interesting to compare the Tl sample to samples of type 
2 AGN and samples of inactive galaxies. We use samples derived 
from the SDSS, to avoid biases due to different selection effects 
of other surveys. We use the classifications of Brinchmann et al. 
(2004, hereafter B04), who classified SDSS galaxy spectra accord- 
ing to the equivalent width and line ratios of narrow emission lines. 
Galaxies with narrow lines were divided according to their posi- 
tions in the BPT diagrams (Baldwin, Phillips & Terlevich 1981; 
Veilleux & Osterbrock 1987). This classification was performed on 
all DR7 spectra in which the galaxy is dominany. Objects which 
are clearly type 1 AGN were excluded, to avoid AGN contami- 
nation of the host galaxy colours. So, the B04 classification in- 
cludes only type 2 AGN. However, as mentioned in Kauffmann 
et al. (2003a), objects with weak broad lines were not excluded. 
Since our Tl sample reaches low LACN/^host values, it can include 
apparently type 2 AGN from B04. We find that out of 23 279 ob- 
jects classified as 'AGN' by B04, 12 340 appear in our parent sam- 
ple (following the S/N > 10 and bad pixel cuts, §2.1). Of these, 
472 (4%) are part of our Tl sample. This fraction is comparable 
with the Kauffmann et al. (2003a) estimate of 8% weak type 1 ob- 
jects in their type 2 SDSS DRl sample, which were not specifically 
identified. 

It is interesting to explore the fraction of Tl objects classified 
by B04 as non AGN. In addition to the 472 Tl objects classified 
as type 2 AGN, 261 Tl objects were classified as composites, 42 



^ Results available at http://www.mpa-garching.mpg.de/SDSS/DR7/ 
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as star forming galaxies (SFG), 20 as LINERs, and one as a non- 
emission line galaxies (NEG). So, a total of 324 objects out of 796, 
or 41%, of low luminosity broad line AGN are not clearly identi- 
fied as AGN based on their narrow lines. In a following paper we 
measure the narrow lines for all the Tl objects, and discuss their 
BPT based identifications. 

Figure 9 presents the classification of SDSS galaxies as a func- 
tion of the spectroscopically measured vLy (6166A), which corre- 
sponds to the SDSS r band (but measured within a 3"-diameter 
fibre). We use all objects in the parent sample (§2.1) which are 
part of the SDSS main galaxy sample (Strauss et al. 2002). The 
Tl classification includes 2766 objects (about 0.1% of the objects 
were not classified neither here nor by B04, and are disregarded). 
For the Tl objects we derive vLv(6166A) from the mean of 20- 
pixels around 6166A. For non-Tl objects we convert the synthetiqj 
r-magnitudes published by B04 to vLy We correct the B04 mea- 
surement by —0.05 dec, which is the mean offset between the two 
methods in common objects. 

The classification fractions in Fig. 9 are somewhat different 
from B04 since some of the classifications depend on the emission 
lines S/N, which are higher on average in the S/N > 10 parent sam- 
ple presented here. Note that SFG heavily dominate (> 90%) at the 
lowest luminosities, and NEG dominate at the highest luminosi- 
ties. The fraction of composite galaxies and type 2 AGN peaks at 
intermediate luminosities. In the Tl sample the luminosity can be 
dominated by the AGN. We therefore remove the AGN contribution 
to vLv(r) using the result presented in §3.4 (eq. 5, 6% of Tl ob- 
jects with an implied negative host are disregarded). The fraction of 
AGN- subtracted Tl objects is marked as 'Tl host'. The error bars 
denote the error in the fraction due to the dispersion of eq. 5 . 

Remarkably, the Tl fraction follows well the NEG fraction, 
or equivalently, their galaxy luminosity distribution is similar. This 
suggests the two populations are related. If NEG are the host galax- 
ies of type 1 AGN (with extended morphologies), then a fixed frac- 
tion of ~ 3% NEG host broad line AGN, at the level detectable in 
this study. Alternatively, if the Tl occur in SFG, then the fraction of 
SFG which host broad line AGN increases sharply from 2 x 10^"' 
to about unity with luminosity. Further results on the Tl hosts are 
given in §3.5, and discussed in §4.3. 



3.2 The IR to X-ray luminosity versus LbHa 

Below we explore the correlation of the continuum luminosity in 
the X-ray, UV, optical, and near IR with L^Ha, which provides a 
measure of the AGN luminosity without host contribution. We di- 
vide the Tl sample into LbHa bins, each half a decade wide (the 
highest bin, LbHa > 10 erg s^' , includes only 14 objects and is 
ignored). Table 2 lists the number of objects per bin and the frac- 
tion of objects with detections in the various bands. The NIR de- 
tection is nearly complete, with a mean detection fraction of 96%. 
The mean UV detection is high (93%), remaining ^ 80% even at 
the lowest LbHa bins. The high detection rates ensures the derived 
mean continuum properties are not significantly biased in the UV 
and in the NIR. The mean X-ray detection fraction is 43%, and 
drops below 50% for LbHa < 42.3 erg s^ . The effects of the X- 
ray band detection rate are discussed in §3.6. A search for matched 
radio detections in the FIRST survey revealed an average detec- 
tion rate of only 18%. We therefore preferred not to explore further 
relations with the radio properties. 
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Figure 9. The distribution of spectral types of 230413 z < 0.2 galaxies in 
the parent sample (§2.1), as a function of the spectroscopy derived lumi- 
nosity (roughly at the r-band). The type 2, SFG, NEG and composite clas- 
sifications (based on the EW of the narrow lines and the BPT diagrams) are 
taken from B04. At low luminosities > 90% of the SDSS galaxies are SFG, 
while at high luminosities the NEG dominate. The Tl curve is based on the 
2 766 extended objects from the Tl sample. The 'Tl host' band is corrected 
for the AGN contribution to the luminosity. The error bars are derived from 
the dispersion in eq. 5. Note that the Tl host luminosity constitute a nearly 
fixed fraction of ~ 3% of the NEG luminosity distribution. This may sug- 
gest that the Tl reside in NEG. The lower panel plots the total number of 
objects per vLy bin of size 0.25 dec. 



Figure 10 shows the correlation of LbHa '^s. vLy in nine rest 
frame bands from the X-ray to the NIR. The 2 keV luminosity is 
from ROSAT, 1528A and 2271 A from GALEX, 3940A, 5100A, 
and 7000A from SDSS, and 1.2;Um, I.7;Um, and 2.2/im (J, H, Ks 
bands) from 2MASS. The optical vLv are measured on 20-pixel 
windows from the SDSS spectrum. The longest rest-frame wave- 
length is chosen so that it is available in all objecto The non- 
optical bands are also in rest wavelengths, calculated from the pho- 
tometric values as described in §2.7. The average and the dispersion 
in log vLv in each LbHa bin are marked in the plot. The ROSAT up- 
per limits are shown in §3.6. The upper limits for the small fraction 
of objects with no GALEX detections are not plotted (it is not ob- 
vious how to derive them from the exposure time). The fraction of 
upper limits for 2MASS is tiny, so we do not plot the upper limits. 

Fig. 10 shows there is a larger dispersion in the UV and X- 
ray luminosities for a given LbHa, compared to the dispersion in 
the optical and NIR bands. This likely reflects the larger variabil- 
ity in these bands. The relations of log vLy vs. log LbHa in the 
different bands clearly do not follow a simple linear (i.e. single 
power-law) relation. Instead of attempting to fit the distributions 
with a more complicated relation, say a double power-law, we 



i.e., the spectrum convolved with the filter pass function. 



Objects with < 6 good pixels in the relevant window are excluded (0.6%, 
2% and 4% in 3940A, 5100A, and 7000A, respectively). 
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Figure 10. The correlation of LbHa vs. observed luminosity from the rest frame X-ray to the NIR (the y-axis values in the X-ray panel are shifted upward by 
1 decade for presentation purpose). The black dots and the associated error bars mark the mean and the dispersion of vLv in each 0.5 decade wide LbHa-bin. 
The dispersions in the UV and X-ray bands are larger than in the optical and NIR bands, probably reflecting the higher variability. The solid lines connect the 
averages of adjacent bins, with the local slopes written underneath. Dashed lines show linear relations, normalized by the highest LbHa-bin. In the luminous 
bins in the FUV, NUV and 3940A panels, the slopes are consistent with a linear relation, in all other bands and luminosities the slopes are flatter (except the 
X-ray panel at the lowest LbHa bins). There is a transition in all bands, excluding X-ray, from a steeper slope to a flatter slope with decreasing LbHa- The 
transition LbHa increases with wavelength, up to a maximum of ~ 10**' erg s"' at I.2;Um. As shown below (Fig. 11), the transition occurs due to the host 
contribution, which peaks at 1.2;Um. The 510()A panel also presents the LbHa — Lagn(5100A) relation from Greene & Ho (2005), derived for a sample selected 
to minimize the host contribution. Objects that appear in the BQS sample are marked by a cross. Note that for X ^ 5100A the BQS objects are near the linear 
relation, consistent with their point like selection criterion which excludes significant host contribution. At shorter X the BQS are above the line, consistent 
with their blue optical colour selection. Note that the EW of Ha, as implied by the 7000A panel, shows an apparent 'inverse Baldwin effect' if the host light 
is not properly subtracted. 
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0.99 


1.00 


0.69 


-0.75 


42.8-43.3 


43.01 


406 


0.205 


0.33 


0.94 


0.98 


0.63 


-1.22 


42.3 - 42.8 


42.54 


662 


0.163 


0.65 


0.95 


0.96 


0.56 


-1.68 


41.8-42.3 


42.04 


859 


0.119 


0.86 


0.97 


0.94 


0.47 


-2.19 


41.3-41.8 


41.56 


853 


0.094 


0.96 


0.97 


0.92 


0.33 


-2.67 


40.8-41.3 


41.09 


488 


0.070 


0.99 


0.99 


0.90 


0.21 


-3.14 


40.3 - 40.8 


40.60 


138 


0.041 


1.00 


0.96 


0.89 


0.20 


-3.62 


39.8-40.3 


40.10 


23 


0.021 


0.96 


0.91 


0.80 


0.22 


-4.12 



Table 2. Detection fractions in the eight LbHa bins. The morphology is taken from the SDSS (Stoughton et al. 2002). The detection fractions are in the 2MASS 
(NIR), GALEX (UV) and ROSAT (X-ray) surveys, and require a detection in all bands of the survey. The R06 scaling is discussed in §3.4. 



just form a 'ridge' line by connecting the averages of the adja- 
cent Z-bHa bins. The slope in each segment (noted in the bottom 
of each panel) is flatter at lower LbHct' excluding the X-ray panel 
(further discussed in §3.6). The change of slope in the FUV oc- 



curs at LbHo 



10' 



41 ^ —1 

' erg s 



The LbHa where the slope change 



occurs increases at longer wavelengths, and reaches a maximum of 
~ 10 erg s^' at 1.2/im. It then decreases back to ~ lO'*^'^ erg s^' 
at 2.2/im. As we show in §3.3, this trend results from the rela- 
tive host contribution at different bands, which peaks at 1.2/xm, 
decreases sharply in the FUV, and nearly disappears in the X-ray. 

The logvLv(FUV) vs. log LbHa slope is close to unity in the 
four highest LbHa bins, i.e. a linear relation between vLv(FUV) 
and LbHa- Since the FUV occurs close to the position of the peak 
SED in AGN (e.g. Zheng et al. 1997), this linear relation suggests 
that LbHa provides a good estimator of Lboi- The simple linear re- 
lation of vLv(FUV) and LbHa suggests two independent linear re- 
lations. A linear relation of vLv(FUV) and the ionizing luminosity, 
as expected if the mean SED of AGN is luminosity independent. 
In addition, it requires a linear relation of LbHa and the ionizing 
luminosity, as expected if the mean covering factor of the BLR is 
also independent of luminosity. 

In each panel of Figure 10 we also plot a linear relation, nor- 
malized to match the highest LbHa-bin. If the AGN SED is indeed 
luminosity independent, than the deviation from linear relation re- 
sults from the host contribution, and thus one can use the deviation 
from the linear relation to estimate the host contribution as a func- 
tion of wavelength. Note that at A Jj 5100A the ridge line slope 
does not approach unity even at the highest LbHa -bin, which may 
suggest the sample does not quite reach the 'pure' AGN emission 
regime (see §3.3). 

Greene & Ho (2005) selected a subset of low z SDSS AGN 
where the host contribution to the optical emission is minimal, 
based on the weakness of the Ca II K stellar absorption feature. 
Their measured vLv(5100A) vs. LbHa relation is plotted in the 
5100A panel of Fig. 10, and indeed passes close to the assumed 
linear relation, normalized at the highest LbHa-bin. This strongly 
supports the host contribution interpretation for the slope changes 
with decreasing luminosity seen in Fig. 10. 



3.2.1 The inverse Baldwin relation for Ha 

We note in passing that the EW of Ha, given to a good approxi- 
mation by LbHa/^^v(7000A), decreases with decreasing luminos- 
ity due to the increasing relative contribution of the host (Fig. 10, 
7000A panel). This corresponds to the 'inverse Baldwin effect' of 
the Balmer lines, noted by Groom et al. (2002) for H/3 and Hy 
in low luminosity AGN found in the 2dF quasar survey. Unlike 
the suggestion that the inverse Baldwin effect of the Balmer lines 



is an intrinsic property of AGN (Groom et al. 2002), our analysis 
suggests the intrinsic Ha EW is luminosity independent, i.e. Ha 
shows no Baldwin effect, as was found for H/3 in more luminous 
AGN (Dietrich et al. 2002; cf. Netzer et al. 2004). 



3.2.2 The vLy vs. LbHa fi"' the BQS sample 

There are 26 objects in the Tl sample that also appear in the BQS 
sample (Neugebauer et al. 1987). These are marked in Fig. 10. The 
BQS selection criteria are of point sources with a relatively blue 
optical colour, in contrast with the Tl selection criteria which are 
independent of morphology and colour It is therefore interesting to 
see how the BQS selection criteria affect their LbHa vs. vLy cor- 
relations. In the A ^ 5100A bands the BQS objects have a lower 
vLv at a given LbHa> and lie closer to the linear relation, in partic- 
ular for the lowest LbHa objects. This offset from the Tl sample is 
consistent with their selection of a weaker host contribution. At the 
X < 5100A bands the BQS objects are offset to higher vLy values. 
The offset increases into the UV, consistent with the optical blue 
colour selection. In the X-ray band the BQS appear to be a ran- 
dom subset of the Tl sample. Note that the BQS sample objects do 
not fall outside the observed Tl objects distributions, but are offset 
towards weaker hosts and stronger UV emission, at a given LbHa- 



3.3 Average SED versus LbHa 

Above we have found indirect indications that 1. the mean SED 
of AGN and the mean covering factor of the BLR are indepen- 
dent of luminosity, 2. the host galaxy contribution flattens the vLv 
vs. LbHa relations with decreasing LbHa- We explore these points 
qualitatively in this section. A quantitative analysis is performed in 
§3-4 - §3-5. Below we form the observed mean AGN SED for the 
eight LbHa bins (Table 2), subtract a mean net AGN SED scaled 
linearly by LbHa^ and compare the residual luminosity to the mean 
SED of a matched sample of inactive galaxies. 

Figure 11, left panel, shows the mean SED in the 2.2/im to 
1528A region for the eight LbHa bins. The integration apertures 
of 2MASS, SDSS, and GALEX are different, thus the fraction of 
the host light included is different in different bands. Furthermore, 
these differences change with z and depend on the host type. Al- 
though the plotted SEDs likely differs from the intrinsic SEDs due 
to these aperture effects, the difference is irrelevant to our purpose 
of understanding the SEDs, as long as all objects used in the anal- 
ysis presented below are similarly affected. 

We cannot use our highest LbHa SED bin as a template net 
AGN SED, as the vLy vs. LbHa relation remains flatter than unity 
at A > 5100A even in that bin, indicating possible significant host 
contribution in those bands. We need a template of more luminous 
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Figure 11. The luminosity dependence of tlie mean SEDs at 2.2;Um-1528A. (Left panel) Tlie mean Tl objects obsei'ved SEDs fortlie eiglit LbHa bins (solid 
lines, see §3.3). Bands from different surveys (different apertures) are connected by a thinner line. Bands with < 70% detection are connected by dotted lines. 
Note the transition from an AGN-dominated UV peaked SED to a galaxy-dominated NIR peaked SED with decreasing LbHa ■ The R06 SED of luminous 
quasars (top solid gray line), is assumed as the net AGN SED. The dashed lines show the expected net AGN contribution, derived by matching the R06 SED 
to the UV in the highest LbHa bin, and rescaling by the relative LbHa of the lower seven bins. The scaled R06 SEDs match the mean SEDs in the UV for the 
2"d_4th i^^^^ bins, as expected from their linear relation (Fig. 10). (Middle panel) Residual SEDs derived by subtracting the scaled R06 SED from the total 
SED (colours match the left panel). For clarity, errors are not shown, though note they are high in the luminous bins, as the residual is a small fraction of the 
total light. The residuals all appear to have a typical galaxy SED. (Right panel) Average SEDs of SDSS inactive galaxies, with ^-distributions matched to the 
z-distributions of the Tl objects in each LbHa bin (mean z noted near each curve). The increase of Lbosi with LbHa. seen in the middle panel, is just a selection 
effect, as inactive galaxies selected at matched z show the same trend, inherent to a flux limited sample. Note the significant excess luminosity in the Kj and 
H-bands in the most luminous residuals (middle panel), implying that a linear scaling of the net AGN SED is applicable only to the optical-UV region. 



AGN, where the host contribution is likely negligible. We therefore 
use Richards et al. (2006, hereafter R06), which provide a mean 
AGN SED constructed using SDSS quasars (with Spitzer detec- 
tions). The R06 template SED is derived from the SDSS photom- 
etry, supplemented with photometry from various other surveys, 
including 2MASS and GALEX. The R06 SED is a factor of ~ 5 
more luminous than our highest L(,Ha-bin SED (see Fig. 11), which 
ensures that host contribution should be lower. In addition, R06 re- 
moved the host contribution, which they find is generally < 0.1 
dec, even in the J-band. Since the FUV of R06 is mostly derived 
from SDSS photometry of high z quasars, it shows a turnover at 
logv > 15.32, induced by the La-forest, which is not present in 
the Tl sample SEDs. We therefore extrapolate the R06 SED up to 
logv =15.4 (the highest relevant rest frame v) withLy °= v^'''^, as 
seen at lower v (see also figure 19 of Trammell et al. 2007). 

The R06 SED was scaled to match the NUV data points of 
the highest LbHa bin (L^Ha = lO^-''^ erg s^', left panel of Fig. 11). 
For each of the seven lower luminosity bins, we further rescaled 
the R06 SED by the mean LbHa in e.a£h bin, relative to LbHa in 
the highest bin. The scale applied to each bin is listed in Table 2. 
Note that the LbHa-rescaled R06 SEDs match well the FUV data 
points of the mean SEDs also in the second to fourth L^Ha bins, as 
expected from their linear relation with LbHa (FUV panel in Fig. 
10). 

We now subtracted the scaled R06 SED, i.e. the matched 
'pure' AGN SED, from the mean SED in each bin, and the residu- 
als are plotted in the middle panel of Fig. 1 1 . This residual includes 
both the host emission, and any change in the mean AGN SED 
with decreasing LbHa- The errors on the residual are high in the 
luminous bins and in the UV, since the residual is a small fraction 
of the total light. For clarity sake, error bars are not shown. The 
possible systematic errors in this method are discussed in §3.4. The 
residuals have similar overall SEDs, with some small differences 
at the NIR. They all appear to be similar to a typical galaxy SED, 
which peaks at log v = 14.4. The residuals imply that the mean host 
galaxy luminosity increases with the mean LbHa luminosity, which 
suggests a tight relation between the AGN luminosity and the host 
luminosity. However, as we show below, this apparently tight rela- 
tion is just a selection effect. 

To compare the residuals to the mean SED of typical galaxies, 
we formed a matched-z sample of 298 SDSS galaxies classified 
by B04 as inactive (excluding 'type 2 AGN' or 'composites', see 
§3.1.4). The galaxies were drawn from the GALEX Deep Imaging 
Survey region, to maximize UV detections, and thus minimize bias 
in terms of their UV emission. Their SEDs were formed in a man- 
ner identical to the Tl sample. For each LbHa -bin of the Tl sample 
we formed a z-matched sample of inactive galaxies in the follow- 
ing manner. First, the inactive galaxies are divided into eight z bins, 
evenly spaced in z''^, and the mean-SED of each z-bin is com- 
puted, forming 'z-specific' SEDs. Then, the z-distribution of each 
AGN LbHa-bin is measured. Finally, the 'z-specific' SEDs are con- 
volved with the different z-distribution functions. Note that only a 
fraction of the high luminosity (and high z) AGN are resolved (Ta- 
ble 2), while all inactive galaxies are resolved by definition (§2.1). 
Therefore, the high z AGN are subject to somewhat different selec- 
tion effects from inactive galaxies with the same z. 

The mean-SEDs of the z-matched inactive galaxies, are shown 
in the right panel of Figure 1 1 . The convolved detection fraction is 
< 70% at high z in the FUV, and at low z in the NIR, so the SEDs 
may be somewhat biased in these regions. Except in the lowest z 
bins, the residuals largely resemble the inactive galaxies, both in 
shape and in luminosity. The trend of increasing host luminosity 
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with the AGN luminosity, seen in the middle panel, just reflects 
the fact that higher luminosity AGN reside at higher z. Since most 
of the Tl objects are selected from the SDSS sample of extended 
objects (galaxies), and the SDSS sample is flux limited, the higher 
z AGN must reside in higher luminosity host galaxies. Now, for the 
unresolved AGN, which dominate at the highest LbHa (see Table 
2), the hosts are not forced to be more luminous to be detectable 
at higher z, and indeed the inactive galaxies are somewhat more 
luminous than the AGN hosts at the highest LbHa bins. Unresolved 
AGN flag the position of their host galaxies, galaxies which would 
have not necessarily passed the flux limit of the SDSS survey. 

The subtraction of the scaled R06 AGN SED leads to a resid- 
ual SED which is generally consistent with the mean SED of inac- 
tive galaxies in the middle and high LbHa bins, indicating the linear 
scaling of the R06 SED with LbHa is roughly correct. At the low- 
est LbHa bin the inactive galaxies show excess UV emission. This 
is unlikely to be an AGN SED effect, as the host dominates the 
emission at all bands (Fig. 11, left panel), and likely reflects the dif- 
ference in the mean AGN host galaxy (NEG) and the mean SDSS 
inactive galaxy at low z (SFG, see Fig. 9). There is also signifi- 
cant excess luminosity in the Kg and H-bands in the most luminous 
residuals. This could be due to a lower dust covering factor with 
increasing luminosity, as observed by Maiolino et al. (2007) and 
Treister et al. (2008), and thus weaker dust emission in the R06 
template compared to the Tl AGN. We do not attempt to quan- 
tify this relation or pursue it further, yet we note that the simple 
scaling law of a fixed AGN SED does not apply in the near IR. 
Note that the X-ray emission also does not scale linearly with LbHa 
either (Fig. 10 and §3.6). Therefore, we emphasize the applicabil- 
ity of this simple scaling law is limited only to AGN optical-UV 
emission, the region most likely dominated by the accretion disc 
emission. 



3.4 The AGN SED scaling with LbHa 

We express the net AGN vLv (= Lagn) by scaling the R06 SED 
by LbHa: 

LbHo 



Lagn(A) = (iR06(A) ± 10%) X (:^ 



2 erg s ' 



f (4) 



The normalization of LbHa^ iC^'^ erg s^', corresponds to LbHa = 
10 erg s^ of the highest luminosity bin, times the difference of 
lO"'^ in vLv(NUV) between this bin and the R06 SED, as described 
in the previous section. The scaling index is denoted by j3, where 
a j3 independent of A indicates that the mean SED shape remains 
fixed with LbHa- We use a 10% systematic error on the scaled R06 
luminosity to get an estimate of the implied errors in the residual 
fluxes by systematic errors of a given level. 

3.4.1 Constraining ji 

In §3.3, we found indications that j8 ~ I. The left panels in Figure 
12 provide additional support for this linear scaling law. It com- 
pares the implied host luminosity (Lbost = mean vLy — mean L^gn) 
at different LbHa bins, for different /3, to the mean luminosities of 
SDSS type 2 AGN with the same z distribution. The luminosity 
is presented in two optical bands: the SDSS z band (8932A) and 
the SDSS u band (3551 A). The z band is selected as it follows the 
stellar mass. The SDSS u band is a measure of the hotter, more 
massive stars, and thus a measure of the more recent star formation 
rate (SFR). Going further to the UV bands probes hotter and more 
massive stars, and provides a measure of the yet more recent SFR. 
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However, given the little residual emission left in the UV, the un- 
certainty can be large. For all objects, we use the u band and the z 
band photometric luminosities in the 3" SDSS fibre aperture, fixed 
by 0.35 mag, the mean offset for a PSF (Adelman-McCarthy et al. 
2008, see also §2.7). Magnitudes are converted to physical units 
using the SDSS-AB offsets in Abazajian et al. (2004). We use only 
the 2 766 objects from the Tl sample drawn from the SDSS sample 
of galaxies, which have the same selection criteria for spectroscopy 
as the type 2 AGN (§2.1). We use only type 2 AGN (classified by 
B04, see §3.1.4) that pass the S/N and bad pixel criteria applied to 
the Tl sample (§2.1). 

Fig. 12 presents L^ost derived for /3 = 0.9, 1.0 and 1.2. The 
three left panels present the mean Lj,ost(" band), Lj,ost(z band) and 
u — 2(host) colour for the Tl and for z matched type 2 AGN (the 
mean z for each Z-bHa bin is marked at the top of the figure). We also 
plot the relations for inactive galaxies here selected from the com- 
plete SDSS survey (not limited to the deep survey field of GALEX). 
The error bars on the jS = 1 line depict the effect of a 10% system- 
atic error in the scaling of the R06 SED (eq. 4), added to the er- 
ror in the mean. The possible systematic error has a significant ef- 
fect at the highest LbHce bins, where Lhost becomes a small fraction 
of the total luminosity, in particular in the it band. At the highest 
log LbHa = 43 bin, the scaled R06 becomes higher than the mean 
u band luminosity, leading to an implied negative Lhost' and only 
the upper limit from the systematic eiTor is shown. At lower LbHa 
bins the scaled R06 SED becomes much weaker than the observed 
mean SED (left column of Fig. 11), and the host SED nearly equals 
the observed mean SED. 

As Fig. 12 clearly shows, a j8 = 1 leads to a good match of the 
Tl host galaxies to the host galaxies of type 2 AGN, as expected 
based on AGN unification. Values of j3 < 0.9 and j8 > 1 .2 in the 
u band lead to highly discrepant hosts. The z band is more host 
dominated, and less sensitive to the scaled AGN subtraction, so it 
does not provide a strong constraint on j8 . It is remarkable that the 
simplest scaling, j3 = 1, i.e. that the net AGN SED maintains a 
fixed shape, which scales linearly with Lbea^ leads to the closest 
match of the implied type 1 host to the observed type 2 host. The 
small differences in overall luminosity between type 1 hosts and 
type 2 AGN at low LbHa (low z) cannot be due to a different /3, 
as the AGN light is too weak. They could either be a result of the 
different selections used to create the type 1 and type 2 samples, or 
some modification of the unified model at low luminosities. 



lation and by the R06 SED: 



/ 



3.4.2 The mean net AGN SED 

Combining the /3 = 1 deduced from the u and z bands (Fig. 12) with 
the linear relation of vLv(FUV) and Z-bHa (Fig. 10), we get that 
three bands in the 1500 A - 9000 A range scale linearly with LbHa> 
implying that the optical-UV net AGN SED shape is independent 
of luminosity. This result is consistent with the finding of Bentz et 
al. (2009) for a sample of 35 reverberation mapped AGN, where 
they found that the dependence of the BLR radius on AGN optical 
luminosities, corrected for the host emission, is the same as the 
dependence on UV luminosities (Kaspi et al. 2005), which implies 
that the net AGN optical luminosity scales linearly with their UV 
luminosity. 

Below we list the ratios of AGN continuum emission in dif- 
ferent rest-frame wavelengths to LbHa> as implied by the linear re- 
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The implied mean EW of the broad Ha is 570A. The disper- 
sion is derived from the scatter around the linear relation between 
logvLv(FUV) and logLbHa, in the four luminous bins (Fig. 10). 
We do not attempt to measure the dispersion in the LbHa vs. 
Lagn('^) relations at optical A, due to the unknown dispersion 
in the host contribution. However, the dispersion at optical X is 
smaller than in the UV (Fig. 10), despite the additional dispersion 
due to the host contribution. Therefore, for X > FUV, we treat the 
noted scatter of 2.4 as an upper limit on the scatter of Lagn(''-) vs. 

LbHa- 

Using the conversion of R06 from vLv at various bands to the 
bolometric luminosity (see their figure 12), Lboi, we get 



Lbol = 130^24 xLbHa 



(6) 



3.5 The AGN host 

3.5.1 Host mass 

As discussed above (§3.4, and Fig. 11), the apparent correlation of 
Lbost and Lbea^ is most likely just a selection effect, as the mean 
luminosity of inactive galaxies, of similar z distribution, rises simi- 
larly with z. This similarity strongly suggests that the apparent cor- 
relation between LbHa and the AGN Lhost is induced by the correla- 
tion of both quantities with z. To test if the Lbost versus LbHa corre- 
lation is significant, we use the Tl sample presented in Fig. 12 (se- 
lected from the galaxy catalogue), to perform a partial correlation 
analysis. The Tl Lhost(z band) is indeed strongly correlated with 
LbHa> with a Pearson correlation coefficient -P/- (Lbost, LbHa) = 0.66. 
However, both quantities are indeed strongly correlated with z, 
with P,-(Lhost,z) = 0.82, and P,(Lkna,z) = 0.7^1 and the par- 
tial correlation of Lbost (z band) versus LbHa> at a fixed z, is only 
L/(LhostiLbHa)lz = 0.14. Furthermore, a fit of the residuals in the 
Lhost(z band) - z relation versus the residuals in the LbHa - z rela- 
tion gives Lhost (z band) o= LJJj^^, which again demonstrates there is 
no significant dependence of Lbost (z band) on LbHa- 

Note that Lbost and Lagn are not entirely unrelated, as one ex- 
pects a lower limit to Lbost at a given Lagn, in order not to exceed 
LEdd- But the Tl subsample of extended objects excludes AGN 
close to LEdd (§3.1.2, Fig. 8). 

As discussed above (§3.1.4), the Tl Lbost distribution follows 
the NEG luminosity distribution (Fig. 9). We therefore break the 
group of inactive galaxies (Fig. 12, left panels) to its two subgroups 
of NEG and SFG (Fig. 12, right panels). At low LbHa, Lhost(z band) 
follow well NEG. This similarity is consistent with the tendency 
of low luminosity AGN to reside in early type galaxies (Ho et al. 
1997b). With increasing Lbost. the Tl hosts become less luminous 
in the z band than NEG, and at the highest luminosities the Tl hosts 
are intermediate between NEG and SFG. 



The values of P, are calculated using logL^Hajlog Lhost ^nd log z. 
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Figure 12. (Left panels) A compaiison of the implied host luminosity (total - scaled R06), for different scaling laws, with z-matched type 2 AGN, in the u and 
Z bands. The residuals vs. LbHa are shown as solid/dotted lines for linear/non-linear scaling laws. The scaling index, j8 (eq. 4), is noted near each line. EiTor 
bars are the sum of a 10% systematic error (eq. 4) and the error in the mean. The mean z is noted on top. Also shown are z-matched inactive galaxies. In the 
u band, /3 < 0.9 is ruled out, as the implied residual is negative for LbHa > 10 erg s" . It is remarkable that the simplest scaling, /3 = 1 in both bands, leads 
to the closest match of the mean type 1 host to the mean type 2 host. Together with the linear vLv(UV)-LbHa relationship at high luminosities (Fig. 10), this 
implies the mean optical-UV net AGN SED shape is fixed and scales lineai'ly with LbHa ■ The small offset of T 1 and type 2 at low LtHa cannot be explained by 
/3 ^ 1 as the AGN emission is a small fraction of the total fight. (Right panels) A comparison of tfie residual implied from the /5 = 1 scaling law, with inactive 
galaxies (left panel) now broken to NEG and SFG. At low LbHa, hosts of Tl AGN resemble NEG, both in mass (z band) and in specific SFR (i.e. colour), 
consistent with the tendency of low luminosity AGN to reside in elliptical hosts. With increasing LbHa> the Tl hosts become less luminous in the z band and 
bluer than the average NEG. The implied SFR is noted in Table 3 and §3.5. 



3.5.2 Host color 

The u — z colour of the Tl host follows well NEG at the lowest 
^bHa- With increasing LbHa ths Tl hosts become bluer, and are 
intermediate between SFG and NEG at the highest L\^\ia- Kauff- 
mann et al. (2003a) find that low luminosity type 2 AGN have NEG 
colours, while high luminosity type 2 AGN tend to have colours in- 
termediate between NEG and SF galaxies. The results above indi- 
cate that type 1 AGN have the same hosts as type 2 AGN. Note that 
our selection of type 1 AGN is independent of the distinction be- 
tween SFG and NEG, which is based on narrow lines. In contrast, a 
type 2 AGN sample, selected based on narrow line ratios, excludes 
a SF-dominated host galaxy. Although, as shown by Kauffmann et 
al. (2003a, fig. 6 there), only the lowest luminosity AGN can be 
dominated by the host SF line emission. 

In Table 3, we derive the SFR and specific SFR for each LbHa 
bin, using the rest-frame u band luminosity and u — r colour of the 
mean hosts. We calculate the rest frame u band and r band lumi- 
nosities using a power law interpolation between adjacent observed 
bands. The third column lists bT,QQ{u — r) (Blanton & Roweis 2007), 
the ratio of stars formed in the last 300 million years to the total star 
formation history of the galaxy, based on SED modeling of SDSS 
galaxies using the stellar models of Bruzual & Chariot (2003). The 
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Table 3. The dependence of mean aperture SFR (within the SDSS fibre) 
on AGN luminosity. Col. 3 lists the mean ratio of stars formed in the last 
3 X 10** years compared to the total star formation of the galaxy, derived 
from rest-frame u — r colour (Blanton & Roweis 2007). Col. 4 lists the SFR 
derived from u band luminosity (Moustakas et al. 2006). Both measures are 
based on the mean Tl AGN, after subtracting the scaled R06 SED (Fig. 
11). EiTorsare the sum of a 10% systematic error (eq. 4) and the error in the 



fourth column lists the cuiTent SFR, based on the correlation of the 
u band fibre luminosity with Lna in SDSS SFG (Moustakas et al. 
2006). These estimates are subject to a host of systematic effects, 
such as reddening, the time dependence of the star formation, and 
obviously the estimated host luminosity. 
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Figure 13. The mean host to AGN luminosity ratio as a function of AGN 
bolometric luminosity at different rest-frame bands, based on the decompo- 
sition in Fig. 1 1 . The small gray circles mark the transition luminosity from 
host dominance to AGN dominance. In the J band the host dominates up 
to Lboi = 10**'' erg s"', well into the quasar regime, while in the FUV the 
AGN dominates already at Lboi > 10 erg s" . The ratio at A = SIOOA is 
consistent with the Shen et al. (201 1) result (dashed line) which is based on 
the QCV sample. Note that the results are relevant for the specific apertures 
used by GALEX, SDSS and 2MASS. 



3.5.3 The mean host/AGN luminosity ratio 

The results described above provide a practical measure of the av- 
erage host/AGN luminosity ratio in various bands, as a function of 
the AGN luminosity for SDSS, GALEX, and 2MASS. Figure 13 
plots the host luminosity (i.e. residual luminosity) to the net AGN 
luminosity (i.e. the scaled R06 SED emission), as a function of L|,ol 
(eq. 6) for different rest-frame bands. In the J band the host domi- 
nates, on average, up to logL(,oi = 45.5, well into the quasar-scale 
emission, including the highest LbHa-bin in our analysis. At 5 lOOA 
the host dominates at logLboi ^ 44.7, consistent with the relation 
obtained by Shen et al. (201 1) for the SDSS quasar sample. As ex- 
pected, in the FUV the AGN becomes more dominant already at 
logLboi > 42.5. The plotted ratios are strongly dependent on the 
apertures used in each band. Also, the dispersion of Lhost/^AGN in 
each LbHa bin is significant, so individual objects may differ sig- 
nificantly. The plotted ratio only serves as a practical guide for the 
AGN catalogues of SDSS, 2MASS and GALEX samples used here. 



3.6 Unobscured type 2 AGN based on the Lx versus LbHa 
relation 

Unobscured type 2 AGN, or the so-called 'true type 2' AGN, are 
AGN in which the BLR is not observed because it does not ex- 
ist, as opposed to standard type 2 AGN in which the BLR is ob- 
scured. Candidates are often found by looking for low luminosity 
AGN, detected through a nuclear X-ray point source, that do not 
show a sign of a BLR in their optical continuum, but their X-ray 
spectrum does not indicate obscuration (Tran et al. 201 1, Shi et al. 
2010, Rigby et al. 2006, Trump et al. 2009). Below we describe 
how the Lx(= vLy(2 keV)) versus LbHce relation can be used to 
test whether the lack of detection of broad lines is indeed signifi- 
cant, and justifies the true type 2 identification. 

Our X-ray band detection rate for the complete Tl sample is 
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Figure 14. The Lx vs. LbHa relation as a probe for true type 2 AGN. The 
plot includes only Tl objects with F(,Ha > 10"'^^ erg s"' cm^^, where 
> 50% have ROSAT detections (Fig. 3). The non detections are marked 
by triangles, which show the mean upper limits, averaged in half decade 
wide LbHa bins. The best power law fit for the X-ray detections is shown 
as a black line, with gray fines indicating the dispersion. The left pointing 
arrows mark five true type 2 candidates from Tran et al. (201 1) and Shi et al. 
(2010). These objects have an apparently unobscured X-ray emission, but 
no broad Ha detection. Only lES 1927+654 and IRAS 01428-0404 have 
upper fimits on LbHa which are significantly lower than expected based on 
their Lx, and are therefore potentially 'true type T AGN. In NGC 4594 and 
NGC 4698 the expected LbHa, based on their Lx, is a factor of ~ 100 below 
the upper limit on LbHa. and in NGC 3147 50% of the objects are expected 
to be below the LbHa upper limit. The upper limit on LbHa depends on the 
expected Av, which can be derived from the Mbh and L/LEdd estimates for 
each object. 



43%. The detection rate depends on LbHa^ and becomes as low as 
20% in the lowest LbHa bins (Table 2). Therefore, in the following 
analysis of the Lx versus -LbHa relation we utilize only objects with 
FbHa > 10^'^'^ erg s^' cm~-^, which have an X-ray detection rate 
of 61% (see Fig. 3). Figure 14 presents the Lx versus LbHa relation. 



The Lx of inactive galaxies is generally < 10 erg s^ (Bauer et 
al. 2004), so host contribution should not significantly affect our 
analysis. 

A least squares minimization yields 

logLx,42 = (0.79 ±0.02) logLbHa.42 +(0.45±0.01) (7) 



ttox = (-0.09 ±0.01) X log/25ooA.29 - 1-42 



(8) 



where Oox = 0.38 x log(/2 kev/'2500A)' ^bHa and Lx are in units of 
10 erg s^' and /25()oa '"^ 1" ^'^S ^^ Hz^^ Equation 8 is similar 
to eq. 2 in Steffen et al. (2006), where 0!ox °= i°8SsnnA 
The fit relation (eq. 7) is depicted in Fig. 14. For each LbHa -bin, a 
triangle marks the mean upper limit on Lx of the T 1 objects without 
ROSAT detections. 

Eq. 7 provides a useful relation for the search for the putative 
unobscured type 2 AGN. Below we compare the upper limit on 
^bHa for a number of potentially 'true type 2' AGN presented by 
Tran et al. (2011) and Shi et al. (2010) with their predicted LbHa 
based on their observed Lx. 
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For each object we set an upper limit on the broad Ha flux 
as 10% of the continuum flux density near Ha, multiplied by the 
expected Av. The expected Av is estimated from Mbh and L/LEdd> 
by applying eqs. 2 and 3. We use the three objects in Tran et al. 
(2011) where estimates of Mbh and L/L^^^ are provided, and also 
two objects from Shi et al. (2010), NGC 4594 and IRAS 01428- 
0404, where optical spectra and Mbh estimates are provided. These 
five objects have a measured Lx, and we use eq. 7 to derive the 
expected LbHa based on their observed Lx, and compare it to the 
upper limit on LbHa derived above. The five objects are plotted in 
Fig. 14 (left-pointing arrows). Three of the potentially true type 
2 candidates, NGC 4594, NGC 4698, and NGC 3147 have upper 
limits on L[,Ha that can accommodate the expected values of L(,Ha 
based on the extension of eq. 7 to lower luminosities. Thus, the 
absence of a broad Ha in these objects is not significant. A high 
angular resolution spectrum is required to exclude the strong host 
contribution near 7000A, and determine if the expected weak Ha 
is indeed missing. In the other two objects, IRAS 01428—0404 and 
lES 1927-1-654, the expected L^Ha is well above the upper limits. 
The deficiency in LbHa is significant at the levels of 4<T and la, 
respectivel}Q and these two objects appear to be true type 2 AGN. 

An estimate of the expected LbHa can also be derived directly 
from Lboi, using eq. 6. However, the estimated Lboi in Tran et al. 
and Shi et al. is based on Ljomj, and its relation with AGN lu- 
minosity has significant dispersion (Baskin & Laor 2005, Caccian- 
iga & Severgnini 2011). This large dispersion is less inhibiting in 

the estimation of Av, used to derive the upper limit on LbHa- since 

1/4 

Av oc Lj^^j when m is known (eq. 2). An analysis of the relation 

between L[o mj and LbHa in the Tl sample, and its implication on 
the identification of true type 2 AGN, will be performed in a fol- 
lowing paper. 



3.7 The SED dispersion 

Above we have shown that the mean SED of broad line AGN is 
well reproduced by the sum of the mean SED of luminous quasars, 
scaled down by LbHa^ plus the mean SED of inactive galaxies 
(NEG-like at low LbHa^ and bluer with increasing LbHa)- However, 
what is the dispersion in the shapes of individual SEDs? Below we 
provide some measures of the distribution of the SEDs, and discuss 
its possible origin. 



3. 7. 1 The distribution of vLy (FUV) /LbHa <^''^d aopt 

Figure 15, left panel, presents the distribution of vLv(FUV)/LbHa 
in four decade-wide bins of LbHa (coloured solid lines, each bin in 
Fig. 15 is composed of two consecutive LbHa bins from Table 2). 
The distributions for LbHa =? lO'*''* erg s^' show a narrow peak at 
vLv(FUV)/LbHa = 45 — 75, compared to the mean value of 33 (eq. 
5). There is an extended tail towards lower values, which is simi- 
lar in all three bins. The amplitude of the narrow peak drops with 
decreasing mean LbHa- In the lowest bin, LbHa = 10 erg s^', 
the peak becomes broader and is shifted to higher vLv(FUV)/LbHa 
values. As found above (Fig. 10 FUV panel), this bin is affected 
by significant host contribution to the FUV band, which produces 



Our calculation of the dispersion assumes LbHa is the independent vari- 
able, since the Tl sample is selected by the broad Ha. When estimating 
LbHa from Lx one should use an X-ray selected sample and treat Lx as the 
independent variable. 
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Figure 15. (Left panel) The distribution of the vLv(FUV)/LbHa values as 
a function of LbHa ■ Four LbHa bins, a decade wide, are plotted (solid lines). 
The mean LbHa is noted near each distribution. The upper half of the upper 
bin, with 43.3 < logLbua < 43.8, is also shown (dashed Une). The dis- 
tributions for log LbHa > 40.5 have a very similar shape. All distributions 
ai'e asymmetric, with an extended tail towards low vLv(FUV)/LbHa val- 
ues, which may be produced by reddening. In the logLbHa = 40.5 bin, host 
contribution of the UV broadens the distribution and increases the mean 
value. (Right panel) The distribution of optical slope, also binned by Lbua- 
The host emission makes ttop, redder with decreasing AGN luminosity. The 
upper half of the highest luminosity bin is least affected by the host con- 
tribution, and is well fit by a Gaussian (dotted line) and an extended red 
tail, which may also be produced by reddening. There is therefore a small 
dispersion in vLv(FUV)/LbHa and ttopt in the net AGN emission, based on 
the objects least affected by the host contribution. 



the offset to higher vLv(FUV)/LbHa values, and a larger disper- 
sion. The higher LbHa bins are not affected by the host contribu- 
tions, and the observed distribution of vLv(FUV)/LbHa values is 
an intrinsic AGN property. The distribution of these three bins is 
similar, despite being selected differently by the SDSS, where the 
logLbHa = 43 objects are mainly point sources, selected by their 
non-stellar colours, while lower LbHa objects are selected to have 
extended morphologies (§2.1 and Table 2). Since Ha is generally 
powered by recombination, it provides a measure of the ionizing 
photons intercepted by the BLR. Thus, the narrow peak suggests 
there is a small dispersion in Lionizing/vLv(FUV), and a small dis- 
persion in the covering factor of the BLR. The tail towards lower 
vLv(FUV)/LbHa values may be due to reddening, as further dis- 
cussed below based on additional indicators. 

Figure 15, right panel, shows the distribution of the SDSS 
based optical slope aopt = 4 x log(/v(3940A)//v(7000A)) as a 
function of LbHa- The aopt distribution moves to redder values with 
decreasing LbHa^ reflecting the increasing amount of host contri- 
bution. The highest LbHa bin (43.1) is least affected by the host 
contribution, and to further decrease the effect of the host contribu- 
tion we show the upper half of the log LbHa =43.1 bin, where 
log Lbiia = 43.5 (43.3 < log LbHa < 43.8). The distribution is 
well characterized as a Gaussian (see plot) with a red tail excess. 
This distribution is similar to the slope distribution found in lumi- 
nous quasars by Richards et al. (2003), measured between 1450A 



20 Jonathan Stern andAri Laor 



and 4040A of composite spectra. Their slope distribution peaks be- 
tween -0.54 and -0.41 (table 1 there), while 6% of their objects are 
in a red tail, attributed to a reddening effect. Here, the peak of the 
distribution of the LbHa ~ 10 erg s^ objects is at Oopt ~ —0.39, 
and 12% of the objects are in the red tail with Ofopt < — 1 ■ This sim- 
ilarity with Richards et al. (2003) indicates that the AGN slope dis- 
tribution does not change significantly with luminosity, and that the 
host effect on the scatter in Oopt is negligible at logLbHa ^43.3. 
Both the weaker FUV emission and the steeper optical slope may 
indicate dust reddening, and their relation is explored below. Thus, 
we conclude that both the mean net AGN SED shape is independent 
of luminosity, and it also shows a small dispersion, as indicated by 
the distribution of vLv(FUV)/LbHa and ttopt in the AGN dominated 
luminosity range. 



3.7.2 The dependence of the SED on Ofopt 

How does the overall SED change with OCopt? Figure 16 shows the 
mean SED at the eight different LbHa bins presented above (Fig. 
11), further divided into 0.5-wide Oopt bins. To minimize statis- 
tical noise, we present only bins with ^ 10 objects. The inset in 
each panel shows a zoom of the spectrum around 4000A, where the 
AGN emission lines of H5, He, [Ne III], and the stellar absorption 
features of the Ca II H and K lines, can be seen. The stellar ab- 
sorption features provide information on the host stellar population 
(e.g. Wild et al. 2007). Table 4 lists for each bin, the mean-LbHa> 
mean-ttopt, number of objects, and detection fractions in the X-ray 
(dx) and UV (liuv)- Note that at lower luminosities the mean X-ray 
detection fraction becomes well below 50% (Table 4), so the mean 
Lx becomes significantly biased. Furthermore, at logLx < 42 the 
host contribution may become significant (e.g. Bauer et al. 2004). 

In the highest logLbna = 43.5 bin, the Oopt distribution is nar- 
row, as seen in Fig. 15, and there are only three Oopt bins of SEDs. 
The difference between the SEDs is small in the IR, increases to the 
UV, and disappears again in the X-ray. This pattern is highly sug- 
gestive of a reddening effect, as the peak extinction of dust is in the 
UV, and it drops both towards the NIR and towards the X-ray. Also, 
as shown below, the Ha EW is independent of Oiopt- If the observed 
SEDs were intrinsic, the large differences in the FUV emission, of 
more than a factor of > 5, should have produced similar ampli- 
tude in the mean Ha EW, while the observed difference is < 10%. 
These arguments support the suggestion that the red tails discussed 
above (Fig. 15) are due to dust extinction. It also hints that some of 
the dispersion in the narrow Gaussian peak of the aopt distribution 
(O!opt = —0.3 to —0.7) is due to reddening. The reddening scenario 
is further explored in §3.7.3. 

At logLbHa ~ 43.0, there is a larger range of aopt, which is 
also consistent with a larger range of dust extinctions. However, 
there is a small reversal in the NIR, where the steeper aopt ob- 
jects, which are weaker in the UV, become stronger in the NIR. 
The different aopt SEDs cross at A ~ 1/xm. One may suspect that 
the reverse NIR effect results from a larger covering factor of dust, 
and thus stronger dust emission, in more reddened objects. How- 
ever, the difference is most prominent at log v ~ 14.5, where dust 
emission is negligible. The emission of galaxies generally peaks 
at logv ~ 14.5, indicating increasing host contribution as aopt get 
steeper. The lower LbHa bins show a stronger effect, as the NIR be- 
comes significantly more prominent, and the SEDs crossing point 
shifts to shorter A, going down to A ~ 6OOOA at log LbHa ~ 42, and 
A ~ 5000 A at log LbHa ~ 41. 

As the inset shows, with decreasing aopt at a given LbHa> the 
stellar Ca II H and K absorption lines become stronger. The effect 



becomes more prominent at the lower LbHa bins. Thus, apart from 
the highest LbHa bin, the change in the SED with Oopt also reflects 
an increasing host/ AGN luminosity ratio, as also indicated by the 
increasing excess emission at logv ~ 14.5. For logLbHa S; 42, a 
steeper aopt at a given LbHa> is mostly produced by an increased 
host contribution, rather than by increasing dust extinction. This is 
indicated by the shift of the SED crossing point, i.e. spectra have a 
redder slope due to increasing NIR emission, rather than decreasing 
UV emission. 

What makes some AGN be more host dominated than others, 
within the same LbHa (i-e. AGN luminosity) bin? A higher host lu- 
minosity near 1 jiva implies a higher host mass, and if the host is 
bulge dominated, also a higher bulge mass. A higher bulge mass 
implies a higher Mbh> and thus a lower L/Le^jj. Thus, the range of 
host contributions reflects the range of L/LEdd at a given AGN lu- 
minosity. This is further explored below using the relation between 
Av and aopt, and in §3.9 by exploring directly the SED as a function 
ofL/LEddandMBH- 

An association of increasing reddening with increasing 
host/AGN ratio is clearly seen for the logLbHa ~ 42,42.5 AGN 
(Fig. 16). This suggests that the increasing reddening is associ- 
ated with dust in the host galaxy. An alternative explanation for 
the range of reddening is dust local to the AGN, where an excess 
of dust along the line of sight implies a close to 'edge on' view, i.e. 
an 'almost' type 2 AGN. Such a local dust effect is not expected 
to affect host/AGN luminosity ratio near 1 /im, in contrast with the 
observations. We note that we cannot exclude an alternative option 
where the steeper Ofopt bins are composed of two separate popula- 
tions, host dominated AGN, and reddened AGN, in particular if the 
mean broad Ha equivalent width (EWbHa) tends to be higher for 
host dominated AGN. 

The various notions raised above are shown quantitatively in 
Table 4. The mean EWbHa of each bin is tabulated in the sev- 
enth column. In the most luminous bin, the EWbHa are simi- 
lar to within ^^ 10%. This is consistent with the reddening sce- 
nario, as dust, which must reside outside the BLR, is unlikely to 
change the EWbHa- From the logLbHa ^ 43 bin and downwards, 
there is a general decrease of EWbHa with aopt, most likely due 
to the increase in host contribution. In the eighth column we list 
Lhost/LAGN(J), where Lagn(J)= 8.5 LbHa (eq- 5), and Lho.st(J)= 
vLv(J)— Lagn(J)- This column gives quantitatively the dependence 
of aopt on the relative amount of host contribution. As can be seen, 
the relative host contribution is a significant factor in the deter- 
mination of the slope already at the second luminous bin, as con- 
cluded above based on the SED. However, the small reduction of 
< 10% in the EWbHa with steepening aopt, for logLbHa ^ 42.5, 
indicates that the steepening is not due to increasing host contribu- 
tion, but due the increased reddening. At logLbHa ^ 42 the signif- 
icant drop in EWbHa with steepening aopt indicates the steepening 
is due to increasing host contribution, and not due to reddening. In 
column 9, the 4000A-break is tabulated, derived from the ratio of 
flux at 4000A < A < 4100A to the flux at 3850A < A < 3950A 
(D4000; Balogh et al. 1999). The net AGN emission, and the asso- 
ciated 'small blue bump', produce D4000 < 1> while stars produce 
D4000 > 1 which increases with decreasing surface temperature. 
The value of £'4000 increases with decreasing LbHa^ and with de- 
creasing aopt at a given LbHa' reflecting the diminishing AGN con- 
tribution compared to old stars in the host. However, in the three 
lowest luminosity bins, this trend saturates for aopt < —2.5, and 
D4000 depends only on aopt, and is independent of LbHa (see also 
Fig. 16). The dependence of D4oo() on aopt at these luminosities 
probably reflects the host type, where a bluer host galaxy (flatter 
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Figure 16. The mean IR to X-ray observed SEDs and optical spectra near 4000A as a function of LbHa and Olopt- Tlie mean LbHa is noted in each panel, 
and the mean Oopt is noted near each curve. Only bins with ^ 10 objects are shown. The value of Lx is shifted in the y-axis (+1 dec) for presentation. 
The error bars denote the error in the mean Lx and vLv(FtrV). Data from different surveys are connected by thinner lines. The insets show a zoom of the 
optical spectrum around 4000A, showing the AGN emission of H5 A4102, He A3970, and [Ne III] A3869, and Ca II H,K AA3934,3968 stellar absorption 
features. In the highest L|,Ha panel the difference between the SEDs increases from the NIR to the UV and disappeai's in the X-ray, consistent with a reddening 
effect. At 42 ^ logLbHa ^ 43, lower Olopi SEDs also show a stronger bump at log V = 14.4, and stronger stellar absorption features, indicating a stronger 
host contribution, in addition to reddening. At LbHa ^41, the host dominates the emission at all bands (see Figs. 10, 13), and the increase of the equivalent 
widths of the stellar absorption features with decreasing O!opt reflects the change in the host stellar population. Note that dust extinction appears as missing UV 
emission for a steeper ttopt, while host contribution appears as excess NIR emission for a steeper Oopt. The host contribution progresses to shorter wavelengths 
with decreasing LbHa ■ 
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Table 4. Mean quantities of the bins of LbHa and Olopi pictured in Fig. 16. The LbHa ranges are as in Fig. 1 1 and Table 2. The Ua^i bins have a width of 0.5. 
Only bins with A^ ^ 10 are listed. Col. 6 shows the mean and the error in the mean EWbHa- The Lhost/^AGN ratio (Col. 7), is calculated as in Fig. 13. The 
formulation of the 4000A-break (Col. 8) is given in Balogh et al. (1999). 



Wopt) is associated with hotter stars, and thus lower ^4000 values. 
Finally, the last column shows the average Av of each bin. The Av 
follows Oopt at all LbHa^ similar to Z)4ooo and L[,ost/iAGN(J)- This 
trend reflects the Mbh versus bulge mass relation. As discussed 
above, a steeper Oopt is associated with a more massive bulge, at 
a given AGN luminosity (given LbHa bin)- A higher bulge mass 
implies a higher Mge, and therefore necessarily requires a higher 
Av. Since Mbh °= '^v^ (eq. 2), at a fixed AGN luminosity, we ex- 
pect a steeper than linear rise in Lhost/^AGNC-T) with Av, as Table 4 
indicates. Also, at the three lowest L|,Ha bins, which are dominated 
by the host emission, a bluer host colour is associated with a lower 
Av, indicating a lower bulge mass, and thus a later type host galaxy. 
This result is consistent with the tendency of bluer galaxies to be of 
later type. 



3.7.3 The ejfect of dust extinction on the SED 



Above we have shown that the effect of the host on the optical slope 
is small at LbHa ~ 10 erg s^ , and that the dispersion in optical 
slope at this luminosity seems to originate from a reddening effect. 
In this subsection we evaluate the implied general properties of the 
extinction law. 

The upper panel of Figure 17 compares vLv(UV)/LbHa with 
Ofopt in the Tl objects from the LbHa = 43.5 bin. The vLv(UV) 
value is the geometrical mean of the NUV and FUV luminosities. 
There is a strong correlation of Oopt versus vLv(UV)/LbHa (Pr = 
0.78, significance of llcJ), with a best-fitting least squares slope 
of 0.93 ± 0.06. This correlation strongly suggests reddening is a 
significant cause of dispersion in Ofopt. already in the — 1 < cCopt < 
range (LV = 0.5, significance of 6cT for objects with oiopt > — 1)- 

Over plotted on the data are the expected relations for dif- 
ferent extinction curves, calculated using the formulations in Pel 
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(1992) for MW, SMC and LMC dust, and also the 'grey dust' 
formulation in Gaskell & Benker (2007). We assume the intrin- 
sic unabsorbed mean values of ttopt = —0.3 which extends to the 
UV, and vLv(UV)/L|,Ha ~ 47. We apply the appropriate extinc- 
tion law , and then convolve the extincted power law with the re- 
sponse functions of the GALEX filters at z = 0.24 (the mean z of 
the log LbHa = 43.5 bin, Table 2). The SMC or LMC extinction 
laws are preferred over MW and grey dust. 

An SMC extinction correction of Aofopt ~ 0.4 translates to 
E{B -V)= 0.1. Therefore, the Oopt = -0.7 bin (30% of the ob- 
jects. Table 4) and Ofopt = — 1.1 bin (9% of the objects) have an 
ii (S — V) of 0. 1 and 0.2 larger than the a^pt = —0.3 bin. The mean 
E{B — V) of the Ofopt = —0.3 bin, which includes the majority of 
the objects, is hard to assess, as there may be some intrinsic dis- 
persion in otopt (the best fitting slope in Fig. 17 yields a resid- 
ual scatter of 0.29 in Oopt). If the 10 percentile of flattest ttopt of 
all logLbHa = 43.5 Tl objects, —0.13, is the intrinsic unreddened 
slope, then the mean E{B — V) of the ttopt = —0.3 bin is ~ 0.04. 

In the lower panel we compare the UV slope auv (= 5.8 x 
log(/v;Fuv/'v;NUv)) with vLv(UV)/LbHa for the Tl objects with 
LbHa > 10 erg s^ , where the host emission in the NUV is neg- 
ligible (note auv and vLv(UV) are independent quantities). The 
general trend is a decreasing Ofuv with decreasing vLv(UV)/LbHa, 
as expected from reddening, but the scatter is significantly larger 
than the relation above with aopt. The mean trend seems to favor 
the MW or the grey dust curve. Note that unlike oiopt, which is 
based on the continuum derived from the SDSS spectroscopy, auv 
and vLy (UV) are derived from broad band photometry, which in- 
cludes the UV continuum and emission features. The dispersion in 
emission line properties likely induces the large scatter observed. 
We also note that the extinction law lines are derived based on the 
mean z of 0.18, while the apparent amount of reddening changes 
with z as different rest-frame frequencies enter the filters. This is 
especially pronounced in the MW extinction curve, as the 2175A 
feature enters and leaves the two UV filters. UV spectroscopy is 
required to derive more accurate continuum slopes in the UV, and 
to test whether the extinctions laws implied by the UV and by the 
optical are indeed inconsistent. 



3.8 The dependence of the SED on LbHa and Av 

In the following section we briefly explore the dependence of the 
mean SED on L/Lejjj and Mgn- Since both parameters are derived 
from the two directly observed quantities, LbHa and Av, we first 
explore the dependence of the mean SED on these two quantities. 

Figure 18 presents the mean optical spectra as a function of 
LbHa and Av. Each panel is for one decade wide LbHa bins, and the 
plots in each panel are for log Av bins a quarter decade wide. Var- 
ious prominent emission lines and Fe 11 multiplets are noted in the 
plot (Vanden Berk et al. 2001; Phillips 1978). The inset in the lower 
two panels zooms on the spectrum near 4000 A (see Fig. 16). The 
highest luminosity AGN at log LbHa =43.1 (upper panel) show 
clearly the strong dependence of the Fe II emission on Av, which 
is part of the Boroson & Green (1992) eigenvector 1 (EVl) trends. 
The logAv = 3.2 spectrum shows that Fe II multiplets are present 
from 7000A to below 4200A. The Balmer lines become asymmet- 
ric, with increasing Av, as expected from the EVl correlations. The 
EW of the [O I] A6300, [O II] A3727, and [Ne III] 13869 also in- 
crease with increasing Av. Stellar absorption features such as Ca II 
H,K AA3934, 3968, G4300, Mgb 15175, and NaD 15893 become 
clear at the highest log Av = 4.1 plot, and some can be traced back 
to lower Av values. 
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Figure 17. The observed relation of Oopt and auv with vLv(UV)/LbHa- 
Over plotted are the expected relations for different extinction laws of an in- 
trinsic power law emission with Oopt = 0<uv = —0.3 and a vLv(UV)/LbHa = 
47. Small dashes '-' markjumpsof 0.1 in £(B — y). The minimum log LbHa 
of the Tl objects used, to avoid host contribution, are noted in each panel. 
The data points are fit by linear relations (dashed lines). Upper panel The 
two independent ratios are highly correlated, and follow an SMC/LMC 
extinction law. This correlation suggests that reddening affects the disper- 
sion in ttopt also for Olopi > — 1 (the narrow Gaussian peak of the cHopt and 
vLv (UV) /LbHa distributions. Fig. 15). Lower panel There is a general de- 
crease of auv with vLv(UV)/LbHai consistent with reddening, though with 
substantial scatter. The mean trend prefers the MW / Gaskell & Benker 
(2007) extinction curves, although photometry-based slopes are signifi- 
cantly uncertain. 



The logLbHa = 42.3 bin shows a more prominent host con- 
tribution with increasing Av, as indicated by the increasing stel- 
lar absorption features and by the steepening aopt. These eifects 
are more prominent than in the upper panel. However the increase 
in the narrow line EW with increasing Av is not as strong as 
in the logLbHa = 43.1 bin. The lowest two panels show spec- 
tra which are dominated by the host continuum. This is demon- 
strated by the fact that the mean continuum luminosity density in 
the logLbHa = 41.4 bin is typically only a factor of two lower than 
in the logLbHa = 42.3 panel, although the expected net AGN con- 
tribution, based on LbHa, drops by an order of magnitude. 



3.9 The dependence of the SED on L/Ledd and Mbh 

Figure 19, presents the SED binned by L/L^dd, for sach panel, 
and by Mbh, within each panel. The bins are a decade wide, with 
L/L^dd in the range — 3 < / < and Mbh in the range 6 < m < 9. 
With decreasing L/L^dd ths host contribution becomes more dom- 
inant, as expected since L/L^dd provides a measure of L/Lbuige- 
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Figure 18. The mean optical spectra as a function of LbHa and Av (listed in each panel). To increase clarity, some mean spectra were shifted, as noted in the 
legends. Some prominent emission lines and Fe II multiplets are marked. The inset zooms on the spectrum near 4000A, as in Fig. 16. The highest LbHa bin 
shows the increasing EVl emission (BG92) with decreasing Av. The BG92 analysis was near Hj3 and this plot shows the extension of the EVl features from 
7000A to below 4000A. Note the change in the Fe II multiplets near 4250A and 6250A, and in the various forbidden Unes at A > 5500A and X < 4500A. At 
logLbHa = 42.3 there is an increase in stellar absorption features with increasing Av, in addition to the EVl trend. Higher Av implies a higher Mbh and thus 
a more massive and more luminous bulge (up to the SDSS aperture effect) for the same mean AGN luminosity. At logLbHa ^ 41.4, the host dominates the 
continuum, and at logLbHa = 40.5 the broad H/3 becomes undetectable due to the host contribution. Such objects will be defined as net type 2 AGN based on 
H/3 only. 



However, we also see a trend where the host contribution becomes 
more dominant (the NIR region of the SED turning from a dip into 
a peak) with decreasing Mbh at a given L/L^^^, in particular for 
/ Jj — 1.4 (the two upper panels). This likely arises from the fact 
that Lhost/^bulge rises with decreasing Lbulgo as expected in a flux 



limited sample (SDSS), where the low luminosity bulges are more 
commonly found in more luminous disc galaxies, rather than in 
lower luminosity bulge-only galaxies (i.e. dwarf ellipticals), which 
will be below the detection limit. Thus, Fig. 19 indicates that at a 
fixed I, the host tends to become disc dominated with decreasing m. 
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Figure 19. The mean observed SED as a function of L/Lem and Mbh- 
The bins are a decade wide in both m and /, in the ranges — 3 < / < and 
6 < m < 9. Each panel is for a given / bin (mean values in the upper left 
comer), subdivided by m (mean values near each curve). The values of Lx 
are shifted in the y-axis (+1 dec) for presentation. The error bars denote the 
error in the mean Lx and vLv(FUV). Note the increase in the relative host 
contribution (the bump near log V = 14.4) with decreasing / (top to bottom 
panel), as expected since Lagn/^bh corresponds to LAGN/^bulee and the 
Tl sample is bulge dominated (§3.5). Note also the increase in the relative 
host contribution with decreasing m, at a given /, in the middle and upper 
panels. This likely reflects the increase in Lhost/^bulge with decreasing m, as 
smaller bulges are often found in disc galaxies. At / < —2 (lowest panel), 
the optical-NUV SED does not change with Mbh, as the host dominates 
for all Mbh- The fraction of X-ray detections is > 50% only in the four top 
curves, so other plotted Lx values can be significantly biased. 
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In the lowest L/L^uige bin (/ = —2.3) the host dominates at all m, 
although some AGN contribution can be noticed in the FUV and 
X-ray, in particular at m = 8.3, which corresponds to the highest 
AGN luminosity. 

The SED trends seen in Fig. 19 are dominated by the host con- 
tribution. To explore trends in the net AGN SED with L/Lejjj and 
Mbh, we use the emission bands least affected by the host contri- 
bution, which are the FUV and X-ray bands (e.g. Figs. 10, 13). Fol- 
lowing the discussions in §3.3 and §3.6, in order to avoid host con- 
tribution in these bands, we utilize only objects with logLbHa > 42 
when analyzing the FUV emission, and objects with logL^Ha > 41 
when analyzing X-ray emission. Also, to reduce detection biases 
when analyzing X-ray emission, we use only objects with FbHa > 
10^ erg s^' cm^ (Fig. 3, §3.6), where the X-ray detection rate 
is > 50%. 

Below we explore which of the parameters, LbHcc ^v, Mbh 
and L/Le(J(J, appears to be the dominant parameter which drives 
the values of three SED parameters, Lx/i-bHa, vLv(FUV)/LbHa, 
and their ratio apuv.x = 0.42 x log /v(2keV)/lv(FUV). Since 
^bHa "^ ^boi (eq- 6), the ratios of Lx and of vLv(FUV) to L|,Ha 
is a measure of the fraction of the bolometric luminosity emitted 
at each band. The value of apuv.x is closely related to the more 
commonly used 2500A-2 keV slope (oJox), which in eq. 8 was cal- 
culated by deriving the 2500A luminosity from L^Ha (eq. 5). 

The binning into L/LEdd and Mbh is performed in the follow- 
ing manner. The objects are sorted by either m or by / and divided 
into four equal size groups. Each of these groups is then sorted by 
the other property, and again divided into four equal size groups. 
This ensures similar statistical errors in all bins. We repeat this pro- 
cesses by sorting based on Av and L^Ha- 

Figure 20, left column, presents the derived relations of the 
mean values of the three SED parameters as a function of /, at a 
fixed m, for vLv(FUV)/L(,Ha, and as a function of ra at a fixed / for 
Lx/LbHa and for ccpuv.x- Error bars denote the error in the mean. 
The upper panels show that vLv(FUV)/LbHc( is largely set by /, and 
is independent of m. The drop of vLv(FUV)/LbHa steepens with /, 
with a drop mostly at / < — 1.3. The right hand column of Fig. 20 
presents vLv(FUV)/LbHa as a function of Av for different bins in 
L(,Ha- Both parameters contribute, as vLv(FUV)/LbHa decreases 
with increasing Av, at a fixed L^Ha, and also with decreasing L^Ha, 
at a fixed Av. The comparison of the upper two panels indicates 
that / is the main driver of the vLv(FUV)/LbHc( distribution, and 
the trends with Av and LbHa are driven by the / dependence. 

As discussed above, the tail towards lower vLv(FUV)/LbHc( 
values (Figs. 15, 17) is likely due to reddening. The mean 
vLv(FUV)/LbHc( = 18 at Z = -2 (Fig. 20) may result from an in- 
creased amount of reddening with decreasing L/L^dd^ in particular 
close to L/L^dd = —2. A trend of vLv(FUV)/LbHa with / is consis- 
tent with the trend described above of an increasing reddening with 
the host/ AGN luminosity ratio. 

Figure 20, middle row panels, explore the driving parame- 
ter of Lx/LbHa- Here there are mixed results. There is a general 
trend of decreasing Lx/LbHa with increasing m, but with some / 
dependence of the amplitude for m > 7.5, where a higher I im- 
plies a lower Lx/LbHa- There is also a general trend of decreasing 
Lx/LbHa with increasing LbHa, with a weaker trend of decreasing 
^x/^bHa with increasing Av. None of the four parameters appear 
to be a dominant parameter 

Figure 20, lower row panels, explore the driving parameter of 
apuv.x- The slope gets steeper with increasing m and with increas- 
ing I, and both parameters contribute similarly to the trends. The 
right hand panel shows that 0!puv,x is mostly driven by LbHa- There 
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is only a weak dependence of OJpuv.x on Av. This indicates tiiat tiie 
dependence of Ofpuv.x on Mbh and L/Ledd found by Kelly et al. 
(2008) is probably due to the correlation between these parameters 
and AGN luminosity. 



4 DISCUSSION 

4.1 The distribution of Av 

Croom (20 11) already noted that the mean value of Av of the H/3 
line in the SDSS quasar samples is independent of luminosity. Here 
we extend this result and show that the distribution of Av values is 
also independent of luminosity (Fig. 7). Why is this result surpris- 
ing? The observed value of Av is determined by the AGN luminos- 
ity, which sets the radius of the BLR, and by Mbh, which sets the 
velocity dispersion at that radius. Thus, the distribution of Av value, 
/(Av), is set by the AGN black hole mass function, /(Mbh), and by 
the AGN luminosity function, /(Lbol), which may vary with Mbh- 
The observed /(Av) is also set by the detection efficiency of indi- 
vidual broad line AGN. As shown above, selection effects are not 
expected to dominate /(Av) (Fig. 6, upper panel), thus the lack of 
dependence of /(Av) on LbHa appears to be an intrinsic property 

of the SDSS Tl sample. At face value, the physical implications 

1/2 
are that the mean Mbh increases roughly as L^^^ , and similarly the 

mean L/LEdd also scales as L^^^ (Fig. 6 lower panel). These rela- 
tions reflect the simple fact that the distribution of the objects along 
the L(,oi axis and along the Av axis are independent of each other 

The only significant deviations are the rise in the lower limit of Av 

1/4 
With Lji^jj , consistent with the physical Eddington limit, i.e. Z = 0, 

— 1/4 
and the drop in the upper limits on Av with Z^^j , consistent with 

an upper limit to Mbh of a few 10^ Mq. The convergence of these 

upper and lower limits on Av at L^^x > a few 10 erg s^ leads 

to a narrower /(Av), which produces a small bump in /(Av) at 

^^1000 ~ 7 (Fig. 7). A similar convergence of the range of FWHM 

for the Mg II and C IV lines at the highest continuum luminosities 

was noted by Fine et al. (2008, 2010). 

Could there be another physical effect which controls the form 

of /(Av), other than the the black hole mass function and the AGN 

luminosity function? Laor & Davis (201 1) noted that the accretion 

disc continuum becomes colder with increasing Av, and for a large 

enough value (depending on the black hole spin), the accretion disc 

can become non-ionizing, and thus form a lineless quasar which is 

harder to detect. This may explain the rarity of objects with Av > 

10, 000 — 20, 000 km s^ ' . However, it cannot explain the similarity 

of/(Av) atAv< 10, 000 km s^', which therefore remains an open 

question. 



4.2 The SED 

4.2.1 A uniform blue optical- UV SED 

As described above, at LbHa > 10"*^ erg s^' the net AGN SED 
in the 9000A-1500A range is consistent with a fixed shape, and 
an amplitude which scales linearly with LbHa- The SED shape is 
consistent with the R06 SED of luminous AGN. Furthermore, the 
linearity of the mean LbHa with vLy in the optical-UV range sug- 
gests that the fixed shape SED extends to the ionizing continuum 
as well. The linearity breaks in the X-ray regime, where the relative 
X-ray emission drops with increasing luminosity (Fig. 20, and var- 
ious earlier studies, e.g. Just et al. 2007). Other studies (Maiolino 



et al. 2007; Treister et al. 2008) indicate there is also a non linear- 
ity in the mid-IR emission as well, which also decreases in relative 
strength with increasing luminosity. 

The dispersion in the SED shapes, as indicated by the ob- 
served distribution of vLv(UV)/LbHa and a^pi values (Fig. 15), 
is driven at lower luminosities by the range of host contributions to 
the SED. The luminous AGN (logLbHa = 43.5 bin) show a signif- 
icantly smaller dispersion. Furthermore, for — 1 < Oopt < 0, within 
the narrow Gaussian peak of the distribution, there is a correlation 
of Oopt with vLv(UV)/LbHa, consistent with reddening (Fig. 17). 
This suggests that the intrinsic dispersion in ttopt, after correction 
for reddening, is smaller than observed. These results are consis- 
tent with earlier suggestions (Ward et al. 1987; Gaskell et al. 2004) 
that AGN have a single shape SED over a large luminosity range, 
which is affected by dust and host contribution. 

A rather uniform and blue intrinsic optical-UV SED is ex- 
pected if AGN are powered by a thin accretion disc, which emits 
locally close to a blackbody. A locally blackbody Newtonian accre- 
tion disc produces the well known Ly °= v'' SED at wavelengths 
emitted from intermediate regions in the disc, away from the in- 
ner and outer edges of the disc. Including non-LTE effects in the 
disc atmospheric emission, and the relativistic corrections for the 
disc structure and for the observed emission, leads to an optical-UV 
slopes somewhat steeper than v'' ^, but still bluer than the average 
observed slope (e.g. Bonning et al. 2007). Davis et al. (2007) found 
that a small amount of reddening of L(S — V) ~ 0.03 — 0.055 of 
SMC-like dust, is sufficient to make the observed UV SED con- 
sistent with the accretion disc model predictions. These values are 
consistent with a mean E{B — V) ~ 0.04 for SMC-like dust, de- 
duced above (§3.7.3) by comparing the slope of the 10 percentile 
of flattest luminous AGN ttopt = —0.13, with the peak of the narrow 
distribution at ftopt = —0.3, where 60% of the AGN reside (Table 
4). 



4.2.2 The reddening dust 

What is the fraction of AGN affected by reddening? As noted 
above, in the most luminous AGN bin (logLbHa = 43.5), about 
60% of the AGN may be slightly affected by reddening with 
E{B — V) ~ 0.04, as indicated by the correlation of Oopt vs. 
vLv(UV)/LbHa, consistent with dust extinction affecting the SED 
at -1 < Oopt < (Fig. 17). An additional 30% of these AGN, have 
a mean Oopt = — 0.7 (Table 4), which may correspond to a red- 
dening of £(S — K) ~ 0.1 (§3.7.3), and the remaining 10% which 
reside in the cCopt = —1.1 bin may be affected by a reddening of 
E{B — V) ^ 0.2 (note that 6 more objects with aopt ~ and 4 with 
Ofopt < —1.25 are not included in Table 4, due to our N ^ 10 re- 
quirement, §3.7.2). The dust suppresses vLv(FUV) by a factor of 
two and five in the ofopt = —0.7 and Oopt = — 1.1 bins, indicating 
significant reprocessing of Lboi by the dust. 

The distribution of extinction values appears independent of 
the AGN luminosity for logLboi = 43 — 46, as the tail of the 
vLv(UV)/LbHa distribution remains largely unchanged with LbHa 
(fig. 15, left panel). However, for the quasars sample, Richards et al. 
(2003) find that 6% of the quasars appear to have L(S - V) ~ 0. 13, 
while Hopkins et al. (2004) find that only 2% of the SDSS quasars 
have E{B -V) > 0.1 and 1% have E{B -V) > 0.2, significantly 
lower fractions than found here (but note that some of it may be 
due to different procedures used to derive E{B — V)). This sug- 
gests a decreasing reddening with increasing Lboi, which may be 
related to the observed drop in the mid-IR to Lboi with increasing 
Lbol in AGN (Maiolino et al. 2007; Treister et al. 2008) suggesting 
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Figure 20. The FUV, X-ray, and broad Ha emission ratios as a function of m and / (left column), and as a function of LbHa and Av (riglit column), at the LbHa 
luminosity range where the host contribution is negligible. All bins in each panel have the same number of objects (lower left comer). In search of the primary 
parameter which drives a given ratio, we select for the x-axis either m or / (left column), and Z.(,Ha or Av (right column) to create the smallest spread for the 
different curves, where each curve includes Tl objects with a specific value of the other parameter Error bars denote the error in the mean value in the bin. 
To reduce non-detection biases in the X-ray, in the two lower rows only FbHa > 10"'^ ' erg s"' cm^^ objects are used. (a,b) Note that the primary parameter 
which sets the mean vLv(FUV)/LbHa is /■ The ratio is independent of Mbh- Binning by LbHa and Av produces a larger dispersion. (c,d) Here either m or LbHa 
are the dominant parameters. (e,f) apuv.x. equivalent to the ratio of the upper two ratios. Here LbHa appears to be the dominant parameter 



a drop in the covering factor of dusty gas with increasing lumi- 
nosity. However, the mid-IR emission may be associated with the 
'receding torus' scenario (Lawrence & Elvis 1982) where the as- 
sociated dust is X-ray absorbing and thus optically very thick. The 
dust associated with the obscuring torus will therefore not be the 
reddening dust. Also, as noted above, the Tl sample does not show 
a luminosity dependence for the reddened fraction of the objects. 

An alternative interpretation of the lower fraction of reddened 
AGN in the Richard et al. and Hopkins et al. quasar samples, is 
a relation of reddening with L/L^^^. As discussed above, quasars 
have a higher L/L^^^ by selection (Fig. 8) compared to emission 
line selected Tl sample, and higher L/L^^i AGN tend to have lower 
reddening (Fig. 20). 

What is the nature of the dust? Richards et al. (2003) and Hop- 
kins et al. (2004) find that the SED of their red quasars is well 
reproduced, down to ~ 1500A, by extinction of SMC-like dust. A 
similar conclusion is reached here based on Oiopt (Fig- 17). The local 
UV slope argues against SMC-like dust, but given the large scatter 



and the lack of spectroscopy in the UV, the discrepancy may not be 
significant. 

Where is the dust located? The location of the reddening dust 
can be explored by comparing the near, mid and far-IR emission of 
reddened and unreddened objects, which can tell how far from the 
center the bulk of the reddening dust resides. The possible relation 
of the reddening with the host/ AGN luminosity ratio, or (equiva- 
lently) withL/LEdd> quantities which are not inclination dependent, 
suggests the reddening dust does not reside on the smallest possible 
scales. 



4.3 The host galaxy 

4.3.1 Lhost V.S. Lagn 

In §3.5.1, we use the z band host galaxy luminosity as a measure 
of the host stellar mass (within the SDSS aperture). We find that 
the correlation of the Tl host luminosity with the AGN luminos- 
ity (LbHa) is induced by the correlation of both quantities with z- 
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Partial correlation analysis of the host luminosity versus LbHa> for 
a fixed z, yields no significant correlation. A similar result was ob- 
tained by Hao et al. (2005b) for emission line selected type 1 and 
type 2 AGN from the SDSS. It reflects, as noted by Hao et al., the 
large range of L/LEdd in emission line selected AGN samples, in 
contrast with colour and quasi-stellar selected AGN samples, which 
by selection have a high L/L^dd (s-g- §3.1.2), which induces a cor- 
relation between the AGN and the host luminosity. 

4.3.2 The host type 

Kauffmann et al. (2003a, hereafter K03) compared the host prop- 
erties of type 2 AGN to inactive galaxies in the SDSS survey, and 
found that type 2 AGN reside almost exclusively in massive galax- 
ies. Here, we verify this effect, and find that at low z the mean z 
band host luminosity of type 1 and type 2 AGN, are a factor of 
three to five higher than in inactive galaxies. We further separated 
the inactive galaxies to NEG and SFG, and find that at the lowest 
z, the z band luminosity of Tl AGN hosts follow very closely the 
mean NEG luminosity. So, it appears that the preference of AGN 
for more massive hosts may just reflect the preference for NEG, 
and the fact that NEG tend to be more massive. 

Eurther support for NEG as the host of the Tl AGN comes 
from a comparison of the host luminosity distributions. The NEG 
and the Tl hosts follow similar (fibre r-band) luminosity distribu- 
tion. The relative normalization of Tl AGN is ~ 3% of the NEG. 
This result is consistent with a scenario where ~ 3% of NEG host 
a Tl AGN, irrespective of the NEG luminosity. The similar stellar 
mass distribution of NEG and of type 2 AGN is also noted in Salim 
et al. (2007, Fig. 17 there). In contrast, if Tl occur in SEG, then 
the fraction of SEG hosting a Tl AGN increases from ^ 10^^ to 
~ 1 over the observed range of luminosity (Eig. 9). Such a drastic 
change in the AGN fraction with the host luminosity appears im- 
plausible. Furthermore, it contradicts the fact that the Tl hosts are 
more massive than SFG (Fig. 12, see also Salim et al. 2007). 

With increasing luminosity we find that the hosts of AGN 
become bluer, as was first noted for quasars (Hutchings 1987; 
Kirhakos et al. 1999; Jahnke et al. 2004). This was verified for type 
2 AGN by K03 who found an increasing fraction of younger stars 
with increasing luminosity. A similar trend was noted by Vanden 
Berk et al. (2006) for type 1 SDSS AGN, although the non orthog- 
onality of the host and AGN eigenvectors leads to a non-unique 
solution in the method used in that study (see §2.2). Here we find 
that the Tl and type 2 hosts are similar, and with increasing L\)\ia^ 
the mean host u — z colour for both types gets bluer (Fig. 12). For 
LbHa < 10 erg s^ , the Tl host colour is red, identical to that of 
NEG, and for L^Ha > 10'" erg s^ ' , it becomes bluer, becoming in- 
termediate between NEG and SFG at LbHa si 3 x 10"*^ erg s^ ' . This 
colour effect is also characterized by the tendency of AGN (de- 
pending on the sample selection criteria, e.g. Hickox et al. 2009) 
to reside in the 'green valley' in the colour-luminosity diagrams 
(e.g. Fig. 1 in Salim et al. 2007). The implied specific SER ra- 
tios of 10^ < Z7300 < 10^ (Table 3) are significantly lower 
than for 'starburst' galaxies, i.e. galaxies forming stars faster than 
their mean SER over the Hubble time (log ^300 > —1.5). The high- 
est luminosity Tl AGN are intermediate between NEG and SFG, 
as found in type 2 AGN (Heckman et al. 2004; Wild et al. 2007). 
However, their concentration index and bulge/total light ratio (K03) 
remains similar to NEG, indicating early type hosts. 

Why do low luminosity AGN avoid SFG? Is it due to sup- 
pression of the host SER? This scenario is unlikely given the low 
AGN luminosity, which is well below the host luminosity, and thus 



unlikely to affect the host significantly. In fact, as Lboi increases 
and the AGN dominates the host emission, the host SER actually 
increases, the reverse of what is expected from the AGN SER sup- 
pression scenario. Furthermore, the suppression scenario is incon- 
sistent with the various AGN host galaxy parameters (stellar mass, 
concentration index, bulge/total light ratio) which indicate AGN 
occur in bulge dominated galaxies (see K03), in contrast with the 
average galaxy parameters of SFG which characterize disc domi- 
nated galaxies (Kauffmann et al. 2003b). A similar preference for 
earlier type hosts was also found by Ho et al. (1997b) for the low- 
est luminosity AGN (LbHa '^ 10 srg s^ ), for luminous quasars 
(e.g. McLeod & Rieke 1995), and for X-ray selected AGN (e.g. 
Schade et al. 2000). The early type host galaxy parameters also ar- 
gue against a delay scenario, where the host SF episode shuts off 
by the time the infalling gas reaches the AGN. Massive stars will 
indeed die off within ~ 100 Myr, but the structure of the galaxy 
will take longer to evolve. Since low luminosity AGN and SFG ap- 
pear to have different host mass distribution properties, they are not 
likely to be different phases in the same host population. 

An alternative explanation is that AGN avoid SFG because 
SFG tend to have lower mass bulges. If SFG at low z typically 
have Mbh ^ 10 Mq, yet rather luminous discs, the AGN may 
be optically swamped by the host, even if it shines close to LEdd- 
Another option is obscuration by dust, as high SER is generally as- 
sociated with excess dust extinction. For example, Simoes Lopes et 
al. (2007) find excess dust even in early type galaxies with AGN 
activity. To test whether AGN avoid SFG because of their high 
SER and the associated dust, or because of their low bulge mass, 
one needs to compare the fraction of SFG and the fraction of NEG 
galaxies which host an AGN, at the same bulge mass, based on X- 
ray and mid-IR surveys which are less affected by dust extinction. 
If the fraction is similar, then the SF activity has no effect on the 
AGN activity. 

One should note that the B04 definition of an inactive galaxy 
as SFG or NEG is not physically motivated, but is just based on the 
ability to detect and measure a set of emission lines. It will clearly 
be useful to compare the Tl host properties with inactive galax- 
ies separated based on a physical property, such as the absolute or 
specific SER. 



4.3.3 The SFR values 

The u band luminosity is mostly set by the SFR in the recent ~ 100 
Myr. At Lboi ~ 5 x 10 erg s^' the Tl hosts u band luminosity 
is intermediate between SFG and NEG (Fig. 12), which suggests 
an intermediate SFR. A similar intermediate SFR was found by 
Salim et al. (2007, Fig. 3 there) for the SDSS type 2 AGN, based on 
the UV luminosity from GALEX (see also Heckman et al. 2004). 
The value of the SFR can be estimated from the u band luminosity 
(Moustakas et al. 2006, eq. II). The mean host u band luminosity 
of the Tl AGN then implies a SFR which increases with the AGN 
Lboi. The SFR increases from O.Il Mq yr^ at log Lboi = 42.2 
to 3.5 Mq yr"' at log Lboi = 44.6, with a mean SFR of ~ 1.1 
Mq yr^' for the most abundant log Lboi of 43.7 (derived from 
^bHa ~ 41.6, the most abundant Tl AGN, Fig. 5). The rise in the 
host SFR with Lboi is qualitatively consistent with earlier results for 
type 2 AGN (K03, Silverman et al. 2009, Netzer 2009). The values 
derived above are comparable to the SFR vs. Lboi values presented 
by Trakhtenbrot & Netzer (2010, Fig. 1 there) for SDSS type 2 
AGN, where the SFR is derived from the D4000 (B04), and Lboi is 
derived from the [O III] and [O I] luminosities. Thus, we conclude 
that inferring Lboi from either the NLR ([O III]) or the BLR (Ha), 



and inferring the SFR from eitiier the u band luminosity or from 
D4000. leads to similar Lboi versus SFR relations for type 2 and 
type 1 AGN. 

An estimate of the SFR in type 1 AGN was made by Kim et 
al. (2006), based on the [O II] luminosity, assuming 1/3 comes from 
star formation, and 2/3 is powered by the AGN. They used the Kew- 
ley et al. (2004) [O II] calibration of the SFR and derived a SFR for 
the most abundant Lpnj = 5 x 10'*'' erg s^' of 0.5 Mq yr^', for 
an SDSS based sample of z < 0.3 type 1 AGN, which should over- 
lap well the Tl sample. Kim et al. note the [O II] emission may all 
be powered by the AGN, so the actual SFR could be even smaller. 
Note that the [O II] based SFR values for the SDSS type 2 AGN, 
presented by Silverman et al. (2009, Fig. 10 there), is typically in 
the range of 0. 1 — 1 Mq yr^ ' , which is significantly below the UV 
and the Ha based estimate for type 2 SDSS AGN, of 1 - 10 Mq 
yr^' (Salim et al. 2007, Fig. 3 there). So, apparently there is a dis- 
crepancy between the u/UV and the [O II] derived SFR in type 2 
AGN (see discussion in Ho 2005), which may also be present in 
the Tl sample, if [O II] is mostly powered by the AGN, and the im- 
plied SFR should be lower. Since [O II] measures the most recent 
SFR compared to the UV/m continuum, the discrepancy may imply 
a post starburst. 



5 CONCLUSION 

We present and analyze a new sample (Tl) of 3 579 broad Ha 
selected AGN from the SDSS DR7, with logLbHa = 40 - 44, 
which spans m = 6 - 9 and Z = -3 - 0. We add UV (GALEX), 
IR (2MASS), and X-ray (ROSAT) luminosities to form the mean 
SED. The main results are: 

(i) The Ha FWHM velocity distribution dA'/dlogAv is independent 
of luminosity and falls exponentially with Av. The origin of this 
distribution remains to be understood. 

(ii) The observed mean 9000A-1500A SED, as a function of Lbea^ is 
consistent with a sum of the mean SED of luminous quasars, which 
scales linearly with LbHa> and a host galaxy contribution. 

(iii) The host galaxy r-band luminosity function of Tl objects with an 
extended morphology follows the NEG luminosity function, with a 
relative normalization of ^ 3%, suggesting that the host of broad 
line AGN are NEG, and the AGN probability of occurrence is in- 
dependent of the host mass. 

(iv) The mean z band luminosity and the u — z band colour of the low- 
est luminosity Tl host is identical to NEG. The host colour be- 
comes progressively bluer with increasing luminosity. The implied 
mean SFR versus Lboi is similar to that found in type 2 AGN. 
(v) The dispersion in the optical-UV SED in luminous AGN 
(logLbHa ^ 43), is consistent with reddening. This indicates the 
intrinsic SED of AGN is blue, with a small dispersion, as predicted 
from thermal thin accretion disc models. 

(vi) Reddening by dust along the line of sight to Tl AGN is common 

(40%with£(S-V)>0.1). 
(vii) The LbHa versus Lx correlation provides a useful probe for unob- 
scured narrow line AGN. It can be used to test if the absence of a 
broad Ha, in X-ray detected AGN, is significant, 
(viii) The primary parameter which drives apuv.x is the luminosity, 
rather than Mge or L/L^dd- 

(ix) The primary parameter which drives log vLv(FUV)/log L^Ha 
is L/LEdd> which may indicate that lower L/Ledd AGN are more 
likely to be reddened. 
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Figure Al. An example of host subtraction using the Yip04 eigenspectra 
(§2.2). The top Une is the SDSS spectram, after coiTecting for Galactic ex- 
tinction. The bottom line is the eigenspectra-subtracted spectrum. Note the 
high contrast of the broad Ha in the residual. 
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Figure A2. Identifying broad Ha emission (§2.3), shown on an example 
host-subtracted spectrum. The continuum (solid straight line) is interpolated 
from the mean of the regions bordered by dash-dot lines. The potential BLR 
flux (AF) is filled with gray. Flux near the strong narrow lines (dehmited by 
dotted hnes) does not enter the calculation of AF. 
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Appendices 



APPENDIX A: SAMPLE CREATION DETAILS 

Al Bad pixels 

During the initial filtering of the SDSS database (§2.1), we require 
each spectrum to comply with the following criteria, concerning 
'bad' pixels. A pixel is considered as bad if it has any of the flags 
listed in table 1 1 of Stoughton et al. (2002), except 'MANYBAD- 
COLUMNS' and 'NEARBADPIXEL'. Also, due to SDSS over- 
sampling, each pixel within two pixels of a flagged pixel is consid- 
ered bad. 

• < 20% bad pixels near the broad Ha (6250A-6880A) 

• bad pixels near the narrow Ha (6530A-6600A) 

• < 50% bad pixels in the continuum fitting ranges (6125A- 
6250A & 6880A-7000A) 



A2 Host subtraction 

In §2.2, each spectrum is fit with three galaxy eigenspectra (Yip 
et al. 2004) and a power-law component. The fit is performed 
on the following wavelength ranges, chosen since they are de- 
void of the strong AGN and stellar emission lines: 3455A- 
3490A, 3500A-3550A, 3600A-3700A, 4200A-4250A, 4400A- 
4460A, 4490A-4600A, 5130A-5680A, 5750A-5830A, 5920A- 
6050A, and 6130A-6200A. 

An example of a spectrum before and after subtracting the fit 
of the three eigenspectra is shown in Figure Al. 



A3 Broad line flux 

A depiction of the excess flux identification process presented in 
§2.3 is shown in Figure A2. 
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Figure A3. The distribution of the normalized excess flux (§2.3) in the par- 
ent sample (232 837 objects). Normalization is by the Fi dispersion (fJdata) 
in the continuum windows (Fig. A2). Bin size is 0.1. To avoid objects with 
excess flux due to the nornial dispersion (see Gaussian fit to the distribu- 
tion), only the 6 986 objects with AF/fJdata > 2.5 passed to the next stage 
of the algorithm. Also shown is the final Tl sample, the 3 579 objects that 
pass the final criteria in §2.5. 



Figure A3 shows the AF/cTdata histogram of the parent sam- 
ple (§2.1). Near AF/cTdata ~ 0, the distribution fits a Gaussian, as 
expected from the central limit theorem. The objects in the positive 
wing are potential Tl objects. We choose a criterion of AF/ojata > 
2.5, since objects with lower AF/cTdata have a < 1% chance to pass 
the additional criteria described in §2.5. The AF/ojata distribution 
of the Tl sample, after the criteria in §2.5, is also shown. It can 
be seen that a criterion of AF/Odata ^ 12 would have been roughly 
sufficient to avoid false identifications. 
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A4 Broad Ha and narrow lines fit 

The emission lines are fit with Gauss-Hennites functions (GHs) in 
three stages (§2.4). The details of tliis process are described below. 
We use the conventions and notations of van der Marel & Franx 
(1993): 7, V, CT and h' are the flux, mean velocity, dispersion, and 
i* Hermite coefficient, respectively. Fourth-order GHs have finite 
h and h , while the other h' are zero. A line name in subscript 
denotes a property of a specific line. The SDSS flux density error 
at A is denoted by Ej^ . 



A4.1 [OIII] 

If there are < 20% bad pixels at 4967A-5250A, the range is fit with 
three components: 

• An iron template derived from observations of I Zw 1 (kindly 
provided by T Boroson). We fit its flux, velocity, and width (by 
convolving with a Gaussian), denoted ypeu, Vpen and CTpeii, re- 
spectively. The value of ype ii is initialized from the power-law co- 
efficient found in §2.2, assuming a Fe II equivalent width of 7A 
in the fit range. During the fit we keep [Vpe n| < 550 km s^' and 
CTFeii < 20000 km s"'. 

• A broken power law for the underlying continuum and the red 
wing of the broad Hj3. Initialization is as follows: the wavelength 
and flux at the break - 5082A, F;i(5082A); shorter X slope - in- 
teipolation between F;^ (4967 A) and F;^(5082A); longer A slope - 
0. 

• A 4*-order GH for [O III]. The y^o 
initialized from the SDSS fit. The values of k 



) III] ' %) III] 



and C7ro III] are 

'[O III] and /7[o juj are 

initialized to 0. During the fit we keep |V[o iii] I < 550 km s^' and 



FWHMr, 



< 1 400 km s . The fit is run once with hf. 



and 
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/lij jjj, fixed to (a Gaussian), and once when they are allowed to 
vary. The final fit is chosen according to an F-test (p < 0.05). 

The fit is disregarded if there are bad pixels < 6A from the 
fit [O III] peak. The [O III] is considered 'detected' if it has F^ > 
3.5 X £[0 III] , and V^q ^q has not reached the limit mentioned above. 



A4.2 Ha region 

We use nine components to fit the 6250A-6880A region: 

• A continuum. 

• An up to 10*-order GH for the broad Ha (bHa). 

• Seven up to 4*-order GHs for the narrow lines (NLs). If no 
good pixel is found within SA of the rest wavelength of a NL, it is 
not fit. Below, a subscript 'NL' refers to all seven NLs simultane- 
ously. 

Initially, the broad Ha is fit by a 4*-order GH, in order to avoid 
fitting the narrow lines by high-order Hermite polynomials. The 
mean velocities of NLs in a doublet are always kept equal, and the 
flux ratios in the [N II] and [O I] doublets are fixed to 3. We limit 
—550 < Vnl < 550 km s^ . The Levenberg-Marquardt best-fitting 
algorithm (Press et al. 1992) is used. 

Several fit attempts are made, differing by the initialization 
and attainable values of the parameters: 

A. Initialization of the YbHa^ <^feHa and continuum is based on 
the continuum and excess flux found in §2.3. The VbHce and 
all /j(,jj„ are initialized to 0. The Vnl,Onl, ^^nl and h'^^ are 
initialized by the fit properties of [O III]. The tj^l are initialized 



using NL flux ratios of typical Sevferta^'i I^H . During the fit, 
Cnl, h^i^ and /jjj^ are kept fixed to the [O III] values. 

B. Similar to A, but starting from the result of A and allowing all 
Onl to vary together (one additional degree of freedom). We 

limit fTNL< l-5xcT[Oi,ij3- 

C. In this attempt we avoid implausibly large troughs in the broad 
Ha profile, which can be created by the [N II] and nHa compo- 
nents. The maximum F;^ of these 3 NLs are limited such that the 
broad profile will not have a trough deeper than 20% of its local 
F)^. Also, the error entering the fit near these NLs (±IOA) is re- 
duced to a 1/3 of its initial value, effectively improving the fit of 
this region on the expense of regions farther from Ha. The bHa 
and continuum are initialized as in A. The tJvjl, Vnl and cTnl 
are initialized from the result of B, keeping CTnl < 1.5 x cT[o m] 
during the fit. The /j^L'^^nl ar^ initialized to 0, and their abso- 
lute value is kept below h^Q j^, and h^Q m, , respectively. During 
the fit, Onl> ''nl a'^d ''nl °^ different NLs are kept equal. 

D. The bHa and continuum are initialized as in A. The ^nl 
and Vnl are initialized from the highest-F;^ pixel within 5A of 
the rest wavelength of the line. The C7nl is initialized so that 
FWHMnl = 300 km s"' and limited to < 1 .5 x qo m] . As be- 
fore, all (7nl vary together. The h^^ and h'^i^ are fixed to 0. 

The X of the B, C and D fits are compared, using the sum of 
the SDSS flux density error and 10% of the fit flux density as the 
error, and the best result is chosen. Objects in which (TnHa reached 
the limit (1 .5 x (7[o m] ) are found to have non- or barely-detectable 
NLs near Ha. When the narrow Ha component is not restricted 
by other NLs, it erroneously fits the top of the broad Ha. In such 
cases, we use result A, in which CTnl = <^[0 iii] • In the 45 of these 
objects that were fit with F„iia/F[o ni] > 2, the [O III] is weak as 
well, and the problem remains. Therefore, in these 45 objects we 
assume F„Ha,f[N II] =0. 

Finally, in order to fully account for the diversity of the broad 
Ha profile, we run fits with a high-order GH for the broad Ha. The 
initial values of the parameters are taken from the chosen fit above, 
while the new h' are initialized to 0. Only the parameters of the 
broad Ha and the y^h'^ vary during the fit. First, a fit is attempted 
with a 6 -order GH, using hi & /i -j L Then we add h^ & h(„ and 
lastly /j7 & /jg . Additional Hermite coefficients are retained only if 
they pass an F-test (p < 0.05). To avoid having negative or distinct 
features in the broad Ha profile, the broad Ha flux density is set to 
zero at velocities beyond a velocity in which it is < 10% of the flux 
density error. 

After this process, 209 objects (3%) still have a reduced x^ > 
2. By eye-inspection, 80 of them have a broad Ha and a reasonable 
fit. They were added to the Tl sample. Also, 35 show a BLR which 
was improperly fit. Since their number is relatively small, we do 
not attempt any further improvement of their fit, and they do not 



r[Nll]6583A-rnHc<,r[Ol]6300A'r[Si,]673lA'r[Sii]67I6A are initialized to 
1/3, 1/3, 1/20, 1/9, 1/9 of 7[oiii], respectively. 

If [O III] is not detected (0.6% of TI sample), ^l initialization is de- 
rived from the upper limit on 7[o mj ■ If bad pixels prevented fitting [O III] 
(4% of Tl objects), ^l and Vnl are initialized from the SDSS fits. In both 
cases cTnLi'^nl ^^'^ ''nl ^^ initialized to a net Gaussian with FWHIVI = 
300 km s"'. 

'^ If [0 III] is not detected, B is not ran. Instead, A is ran with CTnl allowed 
to \3iy, keeping FWHIVInl < 650 km s^ ' . 

When V and 7 of a GH ai'e allowed to vary, hi & /12 hold higher-order 
information than h^ & h'* (van der IVIarel & Franx 1993). 
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Object Name 


AF/ffdata 


Rejection reason 


J000202.71-010508.8 


2.5 


Aviooo > 25 


J000500.03+002055.1 


2.5 


Aviooo > 25 


J000605.59-092007.0 


2.9 


Af '/ffdala < 4 


J000834.72+0()3 156.1 


11.3 


Af '/ffdata < 4 


J000911.58-003654.7 


14.2 


Fi,ma/El < 2 



Table Bl. Objects with excess flux near Ha that were rejected from the Tl 
sample. Col. 2 lists the normalized excess flux found in §2.3. The criterion 
which the object failed (§2.5) is listed in Col. 3. The electronic version of 
the paper includes all 3 407 objects. 



enter the Tl sample. The rest of the high x objects do not show a 
clear broad Ha. 



A4.3 Hp 

Finally, we fit the region ±6 C7nHa from the H/3 peak, using: 

• A 4 -order GH for the narrow Hj8 . Only y^jjn is fit, initialized 
to 7nHa/3. Other GH parameters equal those fit to Ha. 

• A parabola for the continuum + top of the broad H/3. Initial- 
ization is by the edges and middle of the fit range. The non-linear 
coefficient of the parabola is non-zero only if necessary (F-test, 
p < 0.05). 

The fit is not performed if there are bad pixels less than 4 x CTnHa 
from the peak. 



APPENDIX B: REJECTED OBJECTS 

Table Bl lists objects which passed the selection criteria described 
in §2.1 and §2.3, but failed one of the selection criteria described 
in §2.5. 



